DeepSeek V3

Name: DeepSeek V3
Author: DeepSeek

DeepSeek License

DeepSeek · 671B · mixture-of-experts

🤗 HuggingFace Ollama Official

2024-12-26 131K context 671B params

Use Cases

chat code reasoning multilingual math tools writing summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	362.0 GB	Good	—
Q8_0	8	685.0 GB	Excellent	—

About this model

DeepSeek V3 is a 671B parameter mixture-of-experts model that rivals top proprietary models across coding, math, and general reasoning benchmarks. It uses an efficient MoE architecture that activates only a fraction of its parameters per token, but all 671B weights must be loaded into VRAM. The model demonstrates particularly strong performance on coding and mathematical tasks, making it a compelling open-weight alternative to GPT-4 class models for users with sufficient hardware resources.

Benchmarks

88.5

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run deepseek-v3:q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 671B
Architecture: mixture-of-experts
Context: 131K tokens
Min VRAM: 362.0 GB
Recommended: 362.0 GB
Family: DeepSeek V3
Released: 2024-12-26
License: DeepSeek License