DeepSeek R1 70B

Name: DeepSeek R1 70B
Author: DeepSeek

MIT

DeepSeek · 70B · transformer-decoder

🤗 HuggingFace Ollama Official Paper

2025-01-20 131K context 70B params

Use Cases

chat code reasoning math writing summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	43.5 GB	Good	—
Q5_K_M	5	50.5 GB	Good	—
Q8_0	8	72.0 GB	Excellent	—

About this model

DeepSeek R1 70B is the largest distilled reasoning model in the DeepSeek R1 series, based on the Llama 3.3 70B architecture. It captures the most reasoning capability from the full DeepSeek R1 671B model through distillation, delivering exceptional performance on complex reasoning tasks. This model approaches the reasoning quality of the full R1 model on many benchmarks while requiring far less compute. It excels at advanced mathematics, competitive programming, scientific reasoning, and complex analytical tasks. Multi-GPU setups are recommended for comfortable inference.

Benchmarks

85.5

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run deepseek-r1:70b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 70B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 43.5 GB
Recommended: 43.5 GB
Family: DeepSeek R1
Released: 2025-01-20
License: MIT