Gemma 3n E2B

Name: Gemma 3n E2B
Author: Google

Gemma Terms of Use

Google · 2B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-09-10 33K context 2B params

Use Cases

chat reasoning multilingual vision

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	3.3 GB	Good	—
Q8_0	8	4.2 GB	Good	—
F16	16	6.5 GB	Excellent	—

About this model

Gemma 3n E2B is Google's ultra-compact 2B-parameter multimodal model from the Gemma 3n family, optimized for edge and on-device inference. It handles both text and vision inputs while requiring minimal memory and compute resources. As the smallest model in the Gemma 3n lineup, the E2B variant is ideal for resource-constrained environments where a lightweight model with multimodal capabilities is needed, such as mobile devices and IoT applications.

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run gemma3n:e2b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 2B
Architecture: transformer-decoder
Context: 33K tokens
Min VRAM: 3.3 GB
Recommended: 3.3 GB
Family: Gemma 3
Released: 2025-09-10
License: Gemma Terms of Use