Mixtral 8x22B

Name: Mixtral 8x22B
Author: Mistral AI

Apache 2.0

Mistral AI · 141B · mixture-of-experts

🤗 HuggingFace Ollama Official

2024-04-17 66K context 141B params

Use Cases

chat code reasoning multilingual math tools

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	86.0 GB	Good	—
Q8_0	8	148.0 GB	Good	—

About this model

Mixtral 8x22B is Mistral AI's large-scale mixture-of-experts model featuring 141 billion total parameters with 8 expert groups of 22 billion parameters each. It activates only a subset of experts per token, delivering strong performance with more efficient inference than a comparably sized dense model. The model supports a 64K context window, native function calling, and multilingual generation. It excels at code generation, mathematical reasoning, and tool use, making it well-suited for complex agentic and retrieval-augmented workflows.

Benchmarks

77.8

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run mixtral:8x22b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 141B
Architecture: mixture-of-experts
Context: 66K tokens
Min VRAM: 86.0 GB
Recommended: 86.0 GB
Family: Mistral
Released: 2024-04-17
License: Apache 2.0