Magistral Small 24B

Name: Magistral Small 24B
Author: Mistral AI

Apache 2.0

Mistral AI · 24B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-06-18 131K context 24B params

Use Cases

chat code reasoning math multilingual

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	17.0 GB	Good	—
Q8_0	8	28.0 GB	Good	—
F16	16	52.0 GB	Excellent	—

About this model

Magistral Small 24B is Mistral AI's reasoning-focused model designed for complex problem-solving with transparent chain-of-thought capabilities. It delivers strong performance on mathematical reasoning, code generation, and multilingual tasks. With its 131K context window and efficient 24B parameter count, Magistral Small strikes a balance between reasoning capability and resource requirements, making it accessible for local deployment on mid-range hardware.

Benchmarks

77.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run magistral:q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 24B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 17.0 GB
Recommended: 17.0 GB
Family: Mistral
Released: 2025-06-18
License: Apache 2.0