DeepSeek R1 14B

Name: DeepSeek R1 14B
Author: DeepSeek

MIT

DeepSeek · 14B · transformer-decoder

🤗 HuggingFace Ollama Official Paper

2025-01-20 131K context 14B params

Use Cases

chat code reasoning math

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	9.9 GB	Good	—
Q5_K_M	5	11.3 GB	Good	—
Q8_0	8	16.0 GB	Excellent	—

About this model

DeepSeek R1 14B is a distilled reasoning model based on the Qwen 2.5 14B architecture, trained to replicate the chain-of-thought reasoning capabilities of the full DeepSeek R1 671B model. It offers a substantial reasoning improvement over the 7B variant. This model is particularly strong at mathematical reasoning, competitive programming problems, and scientific analysis. It fits comfortably on GPUs with 16GB VRAM at Q4 quantization, making it one of the most accessible dedicated reasoning models available.

Benchmarks

79.7

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run deepseek-r1:14b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 14B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 9.9 GB
Recommended: 9.9 GB
Family: DeepSeek R1
Released: 2025-01-20
License: MIT