Mixtral 8x7B

by Mistral AI · mistral family

47B

parameters

text-generation code-generation reasoning multilingual math creative-writing summarization

Mixtral 8x7B is Mistral AI's mixture-of-experts model, utilizing eight expert networks of 7B parameters each with a routing mechanism that activates two experts per token. This architecture gives it 47B total parameters but only uses about 13B during inference, providing excellent efficiency. The model delivers performance competitive with much larger dense models while maintaining faster inference speeds. It excels at reasoning, multilingual tasks, and code generation, and is particularly well-suited for users who need high-quality output with reasonable hardware requirements.

Quick Start with Ollama

ollama run 8x7b-instruct-v0.1-q4_K_M
Creator Mistral AI
Parameters 47B
Architecture transformer-decoder
Context Length 32K tokens
License Apache 2.0
Released Dec 11, 2023
Ollama mixtral

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M recommended 22.6 GB 29.7 GB
8x7b-instruct-v0.1-q4_K_M
Q5_K_M 26.3 GB 34.4 GB
8x7b-instruct-v0.1-q5_K_M
Q8_0 44.1 GB 49 GB
8x7b-instruct-v0.1-q8_0

Compatible Hardware for Q4_K_M

Showing compatibility for the recommended quantization (Q4_K_M, 29.7 GB VRAM).

Compatible Hardware

Hardware VRAM Type Fit
Mac Pro M2 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Max 128GB 128 GB mac Runs
MacBook Pro M4 Max 128GB 128 GB mac Runs
Mac Studio M4 Max 64GB 64 GB mac Runs
MacBook Pro M4 Max 64GB 64 GB mac Runs
Mac mini M4 Pro 48GB 48 GB mac Runs
MacBook Pro M4 Max 48GB 48 GB mac Runs
MacBook Pro M4 Pro 48GB 48 GB mac Runs
NVIDIA GeForce RTX 5090 32 GB gpu Runs (tight)
Mac mini M4 32GB 32 GB mac Runs (tight)
AMD Radeon RX 7900 XTX 24 GB gpu CPU Offload
NVIDIA GeForce RTX 3090 24 GB gpu CPU Offload
NVIDIA GeForce RTX 4090 24 GB gpu CPU Offload
Mac mini M4 Pro 24GB 24 GB mac CPU Offload
MacBook Air M4 24GB 24 GB mac CPU Offload
MacBook Pro M4 Pro 24GB 24 GB mac CPU Offload
AMD Radeon RX 7900 XT 20 GB gpu CPU Offload
18 hardware device(s) cannot run this model configuration.

Benchmark Scores

70.6
mmlu