Skip to content

DeepSeek R1 14B

MIT

DeepSeek · 14B · transformer-decoder

2025-01-20 131K context 14B params

Use Cases

chat code reasoning math

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec49.9 GBGood
Q5_K_M511.3 GBGood
Q8_0816.0 GBExcellent

About this model

DeepSeek R1 14B is a distilled reasoning model based on the Qwen 2.5 14B architecture, trained to replicate the chain-of-thought reasoning capabilities of the full DeepSeek R1 671B model. It offers a substantial reasoning improvement over the 7B variant. This model is particularly strong at mathematical reasoning, competitive programming problems, and scientific analysis. It fits comfortably on GPUs with 16GB VRAM at Q4 quantization, making it one of the most accessible dedicated reasoning models available.

Benchmarks

79.7
mmlu