Skip to content

Gemma 4 26B

Apache 2.0

Google · 26B · transformer-decoder

2026-04-02 262K context 26B params

Use Cases

chat code reasoning multilingual vision tools math

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec420.0 GBGood
Q8_0830.0 GBExcellent

About this model

Gemma 4 26B is a Mixture-of-Experts model with 26B total parameters but only 3.8B active per token, giving it exceptional efficiency. It ranks #6 on Arena AI among open models and scores 88.3% on AIME 2026 — remarkable for its active parameter count. The MoE architecture means it fits in ~20 GB VRAM at Q4 while delivering reasoning quality that rivals much larger dense models. Supports 256K context with native vision and tool use.

Benchmarks

88.3
aime2026
77.1
livecodebench