DeepSeek R1 70B

by DeepSeek · deepseek-r1 family

70B

parameters

text-generation code-generation reasoning math creative-writing summarization

DeepSeek R1 70B is the largest distilled reasoning model in the DeepSeek R1 series, based on the Llama 3.3 70B architecture. It captures the most reasoning capability from the full DeepSeek R1 671B model through distillation, delivering exceptional performance on complex reasoning tasks. This model approaches the reasoning quality of the full R1 model on many benchmarks while requiring far less compute. It excels at advanced mathematics, competitive programming, scientific reasoning, and complex analytical tasks. Multi-GPU setups are recommended for comfortable inference.

Quick Start with Ollama

ollama run 70b-q4_K_M
Creator DeepSeek
Parameters 70B
Architecture transformer-decoder
Context Length 128K tokens
License MIT
Released Jan 20, 2025
Ollama deepseek-r1:70b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M recommended 34.9 GB 43.5 GB
70b-q4_K_M
Q5_K_M 40.8 GB 50.5 GB
70b-q5_K_M
Q8_0 64.8 GB 72 GB
70b-q8_0

Compatible Hardware for Q4_K_M

Showing compatibility for the recommended quantization (Q4_K_M, 43.5 GB VRAM).

Compatible Hardware

Hardware VRAM Type Fit
Mac Pro M2 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Ultra 192GB 192 GB mac Runs
Mac Studio M4 Max 128GB 128 GB mac Runs
MacBook Pro M4 Max 128GB 128 GB mac Runs
Mac Studio M4 Max 64GB 64 GB mac Runs
MacBook Pro M4 Max 64GB 64 GB mac Runs
Mac mini M4 Pro 48GB 48 GB mac Runs (tight)
MacBook Pro M4 Max 48GB 48 GB mac Runs (tight)
MacBook Pro M4 Pro 48GB 48 GB mac Runs (tight)
NVIDIA GeForce RTX 5090 32 GB gpu CPU Offload
Mac mini M4 32GB 32 GB mac CPU Offload
25 hardware device(s) cannot run this model configuration.

Benchmark Scores

85.5
mmlu