Skip to content

Gemma 4 E4B

Apache 2.0

Google · 4B · transformer-decoder

2026-04-02 131K context 4B params

Use Cases

chat code reasoning multilingual vision

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec46.0 GBGood
Q8_0810.0 GBExcellent

About this model

Gemma 4 E4B is Google's efficient 4.5B-effective-parameter multimodal model, ideal for laptops and consumer devices. It delivers a significant quality leap over its predecessor Gemma 3n E4B while maintaining a similar footprint. Supports text and vision inputs with 128K context. At Q4 it needs about 6 GB of VRAM, fitting comfortably on entry-level GPUs and 8 GB Macs.