Skip to content

DeepSeek R1 671B

MIT

DeepSeek · 671B · mixture-of-experts

2025-01-20 131K context 671B params

Use Cases

chat code reasoning math writing summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec4362.0 GBGood
Q8_08685.0 GBExcellent

About this model

DeepSeek R1 671B is the full-scale reasoning model from DeepSeek, trained with reinforcement learning to perform deep chain-of-thought reasoning. Unlike the distilled variants, this is the original teacher model with 671B mixture-of-experts parameters, delivering state-of-the-art reasoning performance. The model excels at complex mathematical proofs, multi-step logical reasoning, and challenging coding problems by explicitly showing its reasoning process. Its MIT license makes it one of the most permissively licensed frontier-class models available.

Benchmarks

90.8
mmlu