Qwen 2.5 32B

Name: Qwen 2.5 32B
Author: Alibaba

32B

parameters

text-generation code-generation reasoning multilingual tool-use math creative-writing summarization

Qwen 2.5 32B is a powerful model from Alibaba that delivers excellent performance across reasoning, coding, and multilingual tasks. With 32 billion parameters, it occupies a sweet spot between efficiency and capability that makes it highly popular for local deployment. The model supports 128K context, tool use, and structured output generation. At Q4 quantization, it fits on a single high-end consumer GPU, offering near-70B-class performance at a fraction of the resource cost.

Quick Start with Ollama


ollama run 32b-instruct-q4_K_M

Creator	Alibaba
Parameters	32B
Architecture	transformer-decoder
Context Length	128K tokens
License	Apache 2.0
Released	Sep 19, 2024
Ollama	qwen2.5:32b

Quantization Options

Format	File Size	VRAM Required	Quality	Ollama Tag
Q4_K_M recommended	16 GB	20.7 GB	★ ★ ★ ★ ★	`32b-instruct-q4_K_M`
Q5_K_M	18.7 GB	23.9 GB	★ ★ ★ ★ ★	`32b-instruct-q5_K_M`
Q8_0	28.8 GB	34 GB	★ ★ ★ ★ ★	`32b-instruct-q8_0`

Compatible Hardware for Q4_K_M

Showing compatibility for the recommended quantization (Q4_K_M, 20.7 GB VRAM).

Compatible Hardware

Hardware	VRAM	Type	Fit
Mac Pro M2 Ultra 192GB	192 GB	mac	Runs
Mac Studio M4 Ultra 192GB	192 GB	mac	Runs
Mac Studio M4 Max 128GB	128 GB	mac	Runs
MacBook Pro M4 Max 128GB	128 GB	mac	Runs
Mac Studio M4 Max 64GB	64 GB	mac	Runs
MacBook Pro M4 Max 64GB	64 GB	mac	Runs
Mac mini M4 Pro 48GB	48 GB	mac	Runs
MacBook Pro M4 Max 48GB	48 GB	mac	Runs
MacBook Pro M4 Pro 48GB	48 GB	mac	Runs
NVIDIA GeForce RTX 5090	32 GB	gpu	Runs
Mac mini M4 32GB	32 GB	mac	Runs
AMD Radeon RX 7900 XTX	24 GB	gpu	Runs (tight)
NVIDIA GeForce RTX 3090	24 GB	gpu	Runs (tight)
NVIDIA GeForce RTX 4090	24 GB	gpu	Runs (tight)
Mac mini M4 Pro 24GB	24 GB	mac	Runs (tight)
MacBook Air M4 24GB	24 GB	mac	Runs (tight)
MacBook Pro M4 Pro 24GB	24 GB	mac	Runs (tight)
AMD Radeon RX 7900 XT	20 GB	gpu	CPU Offload
AMD Radeon RX 7800 XT	16 GB	gpu	CPU Offload
Intel Arc A770	16 GB	gpu	CPU Offload
NVIDIA GeForce RTX 4060 Ti 16GB	16 GB	gpu	CPU Offload
NVIDIA GeForce RTX 4070 Ti Super	16 GB	gpu	CPU Offload
NVIDIA GeForce RTX 4080	16 GB	gpu	CPU Offload
NVIDIA GeForce RTX 5080	16 GB	gpu	CPU Offload
Mac mini M4 16GB	16 GB	mac	CPU Offload
MacBook Air M3 16GB	16 GB	mac	CPU Offload
MacBook Air M4 16GB	16 GB	mac	CPU Offload

9 hardware device(s) cannot run this model configuration.

Benchmark Scores

83.3

mmlu