rally-4b

Browser-oriented ONNX export of a Gemma 4 Heretic checkpoint packaged for WebGPU / Transformers.js.

Capabilities

  • Supported inputs: text, image
  • Not included in this package: audio, video

Version Notes

  • This is the stable v1 browser package.
  • For the enhanced multimodal variant, see thomasjvu/rally-4b-v2.
  • The v2 package adds audio, video for this model family.

Provenance

  • Source model: coder3101/gemma-4-E4B-it-heretic
  • Base model for inherited processor assets: google/gemma-4-E4B-it
  • Architecture family: gemma4_conditional_generation
  • Expected architecture: Gemma4ForConditionalGeneration
  • Target dtype: q4f16
  • Target device: webgpu

Expected ONNX Sessions

  • vision_encoder_q4f16.onnx
  • embed_tokens_q4f16.onnx
  • decoder_model_merged_q4f16.onnx

Usage

Load this repo with Transformers.js using the model's transformers.js_config metadata and WebGPU backend.

Downloads last month
23
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for thomasjvu/rally-4b

Quantized
(7)
this model