DeepSeek R1 32B

by DeepSeek · deepseek-r1 family

32B

parameters

text-generation code-generation reasoning math creative-writing

DeepSeek R1 32B is a distilled reasoning model based on the Qwen 2.5 32B architecture, offering strong chain-of-thought reasoning capabilities in a size that fits on high-end consumer hardware. It provides a significant quality uplift over the 14B variant for complex reasoning tasks. This model excels at multi-step mathematical proofs, algorithmic problem solving, and analytical writing. At Q4 quantization it fits on a single 24GB GPU, making it the sweet spot for users who want powerful reasoning without requiring multi-GPU setups.

Quick Start with Ollama

ollama run 32b-q4_K_M
Creator DeepSeek
Parameters 32B
Architecture transformer-decoder
Context Length 128K tokens
License MIT
Released Jan 20, 2025
Ollama deepseek-r1:32b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M recommended 16 GB 20.7 GB
32b-q4_K_M
Q5_K_M 18.7 GB 23.9 GB
32b-q5_K_M
Q8_0 28.8 GB 34 GB
32b-q8_0

Compatible Hardware for Q4_K_M

Showing compatibility for the recommended quantization (Q4_K_M, 20.7 GB VRAM).

Compatible Hardware

Hardware VRAM Type Fit
Mac Pro M2 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Max 128GB 128 GB mac Runs
MacBook Pro M4 Max 128GB 128 GB mac Runs
Mac Studio M4 Max 64GB 64 GB mac Runs
MacBook Pro M4 Max 64GB 64 GB mac Runs
Mac mini M4 Pro 48GB 48 GB mac Runs
MacBook Pro M4 Max 48GB 48 GB mac Runs
MacBook Pro M4 Pro 48GB 48 GB mac Runs
NVIDIA GeForce RTX 5090 32 GB gpu Runs
Mac mini M4 32GB 32 GB mac Runs
AMD Radeon RX 7900 XTX 24 GB gpu Runs (tight)
NVIDIA GeForce RTX 3090 24 GB gpu Runs (tight)
NVIDIA GeForce RTX 4090 24 GB gpu Runs (tight)
Mac mini M4 Pro 24GB 24 GB mac Runs (tight)
MacBook Air M4 24GB 24 GB mac Runs (tight)
MacBook Pro M4 Pro 24GB 24 GB mac Runs (tight)
AMD Radeon RX 7900 XT 20 GB gpu CPU Offload
AMD Radeon RX 7800 XT 16 GB gpu CPU Offload
Intel Arc A770 16 GB gpu CPU Offload
NVIDIA GeForce RTX 4060 Ti 16GB 16 GB gpu CPU Offload
NVIDIA GeForce RTX 4070 Ti Super 16 GB gpu CPU Offload
NVIDIA GeForce RTX 4080 16 GB gpu CPU Offload
NVIDIA GeForce RTX 5080 16 GB gpu CPU Offload
Mac mini M4 16GB 16 GB mac CPU Offload
MacBook Air M3 16GB 16 GB mac CPU Offload
MacBook Air M4 16GB 16 GB mac CPU Offload
9 hardware device(s) cannot run this model configuration.

Benchmark Scores

83.2
mmlu