DeepSeek R1 671B

Name: DeepSeek R1 671B
Author: DeepSeek

MIT

DeepSeek · 671B · mixture-of-experts

🤗 HuggingFace Ollama Official Paper

2025-01-20 131K context 671B params

Use Cases

chat code reasoning math writing summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	362.0 GB	Good	—
Q8_0	8	685.0 GB	Excellent	—

About this model

DeepSeek R1 671B is the full-scale reasoning model from DeepSeek, trained with reinforcement learning to perform deep chain-of-thought reasoning. Unlike the distilled variants, this is the original teacher model with 671B mixture-of-experts parameters, delivering state-of-the-art reasoning performance. The model excels at complex mathematical proofs, multi-step logical reasoning, and challenging coding problems by explicitly showing its reasoning process. Its MIT license makes it one of the most permissively licensed frontier-class models available.

Benchmarks

90.8

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run deepseek-r1:671b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 671B
Architecture: mixture-of-experts
Context: 131K tokens
Min VRAM: 362.0 GB
Recommended: 362.0 GB
Family: DeepSeek R1
Released: 2025-01-20
License: MIT