Devstral 24B

Name: Devstral 24B
Author: Mistral AI

Apache 2.0

Mistral AI · 24B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-05-21 131K context 24B params

Use Cases

chat code reasoning

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	17.0 GB	Good	—
Q8_0	8	29.0 GB	Excellent	—
F16	16	53.0 GB	Excellent	—

About this model

Devstral 24B is Mistral's dedicated coding agent model, fine-tuned from Mistral Small 3.1 for software engineering tasks. It excels at code generation, repository-scale understanding, debugging, and agentic coding workflows. Ranked #1 among open-source coding agent models at launch. At Q4 it fits on 24 GB GPUs — ideal for developers who want a local alternative to cloud-based coding assistants like GitHub Copilot.

Benchmarks

72.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run devstral:24b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 24B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 17.0 GB
Recommended: 17.0 GB
Family: Mistral
Released: 2025-05-21
License: Apache 2.0