Skip to content

DeepSeek R1 32B

MIT

DeepSeek · 32B · transformer-decoder

2025-01-20 131K context 32B params

Use Cases

chat code reasoning math writing

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec420.7 GBGood
Q5_K_M523.9 GBGood
Q8_0834.0 GBExcellent

About this model

DeepSeek R1 32B is a distilled reasoning model based on the Qwen 2.5 32B architecture, offering strong chain-of-thought reasoning capabilities in a size that fits on high-end consumer hardware. It provides a significant quality uplift over the 14B variant for complex reasoning tasks. This model excels at multi-step mathematical proofs, algorithmic problem solving, and analytical writing. At Q4 quantization it fits on a single 24GB GPU, making it the sweet spot for users who want powerful reasoning without requiring multi-GPU setups.

Benchmarks

83.2
mmlu