Skip to content

Meta · 400B · mixture-of-experts

2025-04-05 1.0M context 400B params

Use Cases

chat code reasoning multilingual vision math tools writing summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec4228.0 GBGood
Q8_08410.0 GBExcellent

About this model

Llama 4 Maverick is Meta's flagship mixture-of-experts model with 400B total parameters (17B active per token) across 128 experts. It features native multimodal support for text and image inputs, a 1M token context window, and strong performance across reasoning, coding, and multilingual tasks. Maverick delivers competitive results against top proprietary models while being open-weight. Its massive expert count enables broad knowledge coverage, though the full 400B parameter count means all expert weights must be loaded into memory for inference.

Benchmarks

82.0
mmlu