Skip to content

Gemma 4 E2B

Apache 2.0

Google · 2B · transformer-decoder

2026-04-02 131K context 2B params

Use Cases

chat reasoning multilingual vision

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec44.0 GBGood
Q8_086.0 GBExcellent

About this model

Gemma 4 E2B is Google's smallest Gemma 4 model with 2.3B effective parameters, designed for edge and mobile deployment. Despite its compact size, it supports vision input and 140+ languages with a 128K context window. At Q4 it needs just 4 GB of VRAM, making it one of the most capable models you can run on virtually any hardware — including phones and Raspberry Pi devices.