Mixtral 8x22B
Apache 2.0Mistral AI · 141B · mixture-of-experts
2024-04-17 66K context
141B params
Use Cases
chat code reasoning multilingual math tools
Quantization Options
About this model
Mixtral 8x22B is Mistral AI's large-scale mixture-of-experts model featuring 141 billion total parameters with 8 expert groups of 22 billion parameters each. It activates only a subset of experts per token, delivering strong performance with more efficient inference than a comparably sized dense model.
The model supports a 64K context window, native function calling, and multilingual generation. It excels at code generation, mathematical reasoning, and tool use, making it well-suited for complex agentic and retrieval-augmented workflows.
Benchmarks
77.8
mmlu