Skip to content

Which Models Can You Run?

Every model in the catalog, ranked for your hardware. We auto-detect what you're on; pick manually if it's wrong.

Detecting your hardware…
Sort
Grades
105 of 105 models
ModelParamsMin VRAMStatus
Aya Expanse 32B
Cohere · Aya Expanse
32B22.0 GB
Q4_K_M
View →
Aya Expanse 8B
Cohere · Aya Expanse
8B6.5 GB
Q4_K_M
View →
Codestral 22B
Mistral AI · Mistral
22B14.7 GB
Q4_K_M
View →
Codestral Mamba 7B
Mistral AI · Mistral
7B6.9 GB
Q4_K_M
View →
Cogito 70B
Deep Cogito · Cogito
70B43.0 GB
Q4_K_M
View →
Cogito 32B
Deep Cogito · Cogito
32B21.5 GB
Q4_K_M
View →
Cogito 8B
Deep Cogito · Cogito
8B7.5 GB
Q4_K_M
View →
Command A 111B
Cohere · Command R
111B61.0 GB
Q4_K_M
View →
Command R 35B
Cohere · Command R
35B22.5 GB
Q4_K_M
View →
Command R+ 104B
Cohere · Command R
104B57.0 GB
Q4_K_M
View →
DeepSeek R1 1.5B
DeepSeek · DeepSeek R1
1.5B3.0 GB
Q8_0
View →
DeepSeek R1 14B
DeepSeek · DeepSeek R1
14B9.9 GB
Q4_K_M
View →
DeepSeek R1 32B
DeepSeek · DeepSeek R1
32B20.7 GB
Q4_K_M
View →
DeepSeek R1 671B
DeepSeek · DeepSeek R1
671B362.0 GB
Q4_K_M
View →
DeepSeek R1 7B
DeepSeek · DeepSeek R1
7B9.0 GB
Q8_0
View →
DeepSeek R1 8B
DeepSeek · DeepSeek R1
8B7.5 GB
Q4_K_M
View →
DeepSeek V3-0324
DeepSeek · DeepSeek V3
671B362.0 GB
Q4_K_M
View →
DeepSeek V3.2
DeepSeek · DeepSeek V3
671B420.0 GB
Q4_K_M
View →
DeepSeek R1 70B
DeepSeek · DeepSeek R1
70B43.5 GB
Q4_K_M
View →
DeepSeek V3
DeepSeek · DeepSeek V3
671B362.0 GB
Q4_K_M
View →
Devstral 2 123B
Mistral AI · Mistral
123B67.0 GB
Q4_K_M
View →
Devstral 24B
Mistral AI · Mistral
24B17.0 GB
Q4_K_M
View →
Dolphin Mixtral 8x7B
Cognitive Computations · Dolphin
47B26.0 GB
Q4_K_M
View →
Dolphin 3 8B
Cognitive Computations · Dolphin
8B6.0 GB
Q4_K_M
View →
Falcon 3 10B
TII · Falcon 3
10B8.5 GB
Q4_K_M
View →
Falcon 3 7B
TII · Falcon 3
7B6.8 GB
Q4_K_M
View →
Gemma 2 27B
Google · Gemma 2
27B17.7 GB
Q4_K_M
View →
Gemma 2 2B
Google · Gemma 2
2B4.0 GB
Q8_0
View →
Gemma 2 9B
Google · Gemma 2
9B11.0 GB
Q8_0
View →
Gemma 3 12B
Google · Gemma 3
12B10.5 GB
Q4_K_M
View →
Gemma 3 1B
Google · Gemma 3
1B2.0 GB
Q8_0
View →
Gemma 3 27B
Google · Gemma 3
27B20.0 GB
Q4_K_M
View →
Gemma 3 4B
Google · Gemma 3
4B5.0 GB
Q4_K_M
View →
Gemma 3n E2B
Google · Gemma 3
2B3.3 GB
Q4_K_M
View →
Gemma 3n E4B
Google · Gemma 3
4B4.5 GB
Q4_K_M
View →
Gemma 4 26B
Google · Gemma 4
26B20.0 GB
Q4_K_M
View →
Gemma 4 E2B
Google · Gemma 4
2B4.0 GB
Q4_K_M
View →
Gemma 4 31B
Google · Gemma 4
31B22.0 GB
Q4_K_M
View →
Gemma 4 E4B
Google · Gemma 4
4B6.0 GB
Q4_K_M
View →
GLM-5.1
Zhipu AI · GLM
754B305.0 GB
Q2_K
View →
GLM-5
Zhipu AI · GLM
744B300.0 GB
Q2_K
View →
Granite 3.3 8B
IBM · Granite
8B10.0 GB
Q8_0
View →
InternLM 2.5 20B
Shanghai AI Lab · InternLM 2.5
20B12.0 GB
Q4_K_M
View →
InternLM 2.5 7B
Shanghai AI Lab · InternLM 2.5
7B5.5 GB
Q4_K_M
View →
Kimi K2.5
Moonshot AI · Kimi
1040B390.0 GB
Q2_K
View →
Llama 3.1 405B
Meta · Llama 3
405B244.5 GB
Q4_K_M
View →
Llama 3.1 70B
Meta · Llama 3
70B43.5 GB
Q4_K_M
View →
Llama 3.1 8B
Meta · Llama 3
8B10.0 GB
Q8_0
View →
Llama 3.2 1B
Meta · Llama 3
1B3.0 GB
Q8_0
View →
Llama 3.2 3B
Meta · Llama 3
3B5.0 GB
Q8_0
View →
Llama 3.2 Vision 11B
Meta · Llama 3
11B8.5 GB
Q4_K_M
View →
Llama 3.2 Vision 90B
Meta · Llama 3
90B50.0 GB
Q4_K_M
View →
Llama 3.3 70B
Meta · Llama 3
70B43.5 GB
Q4_K_M
View →
Llama 4 Maverick
Meta · Llama 4
400B228.0 GB
Q4_K_M
View →
Llama 4 Scout (109B/17B active)
Meta · Llama 4
109B72.0 GB
Q4_K_M
View →
Magistral Small 24B
Mistral AI · Mistral
24B17.0 GB
Q4_K_M
View →
Mistral 7B
Mistral AI · Mistral
7B9.0 GB
Q8_0
View →
Mistral Large 2 123B
Mistral AI · Mistral
123B67.0 GB
Q4_K_M
View →
Mistral Nemo 12B
Mistral AI · Mistral
12B9.5 GB
Q4_K_M
View →
Mistral Small 3.1 24B
Mistral AI · Mistral
24B18.0 GB
Q4_K_M
View →
Mixtral 8x22B
Mistral AI · Mistral
141B86.0 GB
Q4_K_M
View →
Mixtral 8x7B
Mistral AI · Mistral
47B29.7 GB
Q4_K_M
View →
Nemotron 3 Nano 8B
NVIDIA · Nemotron
8B7.5 GB
Q4_K_M
View →
Nemotron Ultra 253B
NVIDIA · Nemotron
253B155.0 GB
Q4_K_M
View →
Nous Hermes 2 34B
Nous Research · Nous Hermes
34B19.0 GB
Q4_K_M
View →
Nous Hermes 2 8B
Nous Research · Nous Hermes
8B6.0 GB
Q4_K_M
View →
OpenChat 3.5 7B
OpenChat · OpenChat
7B6.9 GB
Q4_K_M
View →
Phi-3 Mini 3.8B
Microsoft · Phi
3.8B5.8 GB
Q8_0
View →
Phi-4 14B
Microsoft · Phi
14B9.9 GB
Q4_K_M
View →
Phi-4 Mini 3.8B
Microsoft · Phi
3.8B4.5 GB
Q4_K_M
View →
Phi-4 Reasoning 14B
Microsoft · Phi
14B11.0 GB
Q4_K_M
View →
Qwen 2.5 14B
Alibaba · Qwen 2.5
14B9.9 GB
Q4_K_M
View →
Qwen 2.5 32B
Alibaba · Qwen 2.5
32B20.7 GB
Q4_K_M
View →
Qwen 2.5 72B
Alibaba · Qwen 2.5
72B44.7 GB
Q4_K_M
View →
Qwen 2.5 Coder 14B
Alibaba · Qwen 2.5
14B12.0 GB
Q4_K_M
View →
Qwen 2.5 7B
Alibaba · Qwen 2.5
7B9.0 GB
Q8_0
View →
Qwen 2.5 Coder 32B
Alibaba · Qwen 2.5
32B23.0 GB
Q4_K_M
View →
Qwen 2.5 Coder 7B
Alibaba · Qwen 2.5
7B9.0 GB
Q8_0
View →
Qwen 2.5 VL 72B
Alibaba · Qwen 2.5
72B41.0 GB
Q4_K_M
View →
Qwen 2.5 VL 7B
Alibaba · Qwen 2.5
7B7.0 GB
Q4_K_M
View →
Qwen 3 0.6B
Alibaba · Qwen 3
0.6B2.5 GB
Q4_K_M
View →
Qwen 3 235B-A22B
Alibaba · Qwen 3
235B138.0 GB
Q4_K_M
View →
Qwen 3 14B
Alibaba · Qwen 3
14B12.0 GB
Q4_K_M
View →
Qwen 3 30B-A3B (MoE)
Alibaba · Qwen 3
30B22.0 GB
Q4_K_M
View →
Qwen 3 32B
Alibaba · Qwen 3
32B23.0 GB
Q4_K_M
View →
Qwen 3 4B
Alibaba · Qwen 3
4B4.5 GB
Q4_K_M
View →
Qwen 3 8B
Alibaba · Qwen 3
8B7.5 GB
Q4_K_M
View →
Qwen 3.5 0.8B
Alibaba · Qwen 3.5
0.8B1.5 GB
Q4_K_M
View →
Qwen 3.5 122B
Alibaba · Qwen 3.5
122B85.0 GB
Q4_K_M
View →
Qwen 3.5 27B
Alibaba · Qwen 3.5
27B19.0 GB
Q4_K_M
View →
Qwen 3.5 35B A3B
Alibaba · Qwen 3.5
35B12.0 GB
Q4_K_M
View →
Qwen 3.5 2B
Alibaba · Qwen 3.5
2B3.0 GB
Q4_K_M
View →
Qwen 3.5 4B
Alibaba · Qwen 3.5
4B4.5 GB
Q4_K_M
View →
Qwen 3.5 9B
Alibaba · Qwen 3.5
9B7.5 GB
Q4_K_M
View →
SmolLM2 1.7B
Hugging Face · SmolLM
1.7B2.7 GB
Q8_0
View →
QwQ 32B
Alibaba · Qwen 3
32B21.5 GB
Q4_K_M
View →
StarCoder2 15B
BigCode · StarCoder
15B17.0 GB
Q8_0
View →
StarCoder2 3B
BigCode · StarCoder
3B3.5 GB
Q4_K_M
View →
StarCoder2 7B
BigCode · StarCoder
7B5.5 GB
Q4_K_M
View →
WizardCoder 33B
Microsoft · WizardLM
33B22.0 GB
Q4_K_M
View →
WizardLM 2 7B
Microsoft · WizardLM
7B6.9 GB
Q4_K_M
View →
Yi 1.5 34B
01.AI · Yi 1.5
34B21.0 GB
Q4_K_M
View →
Yi 1.5 6B
01.AI · Yi 1.5
6B5.0 GB
Q4_K_M
View →
Yi 1.5 9B
01.AI · Yi 1.5
9B6.5 GB
Q4_K_M
View →
Yi Coder 9B
01.AI · Yi Coder
9B8.0 GB
Q4_K_M
View →
S · Runs great

Fast inference with plenty of VRAM headroom. Score 85+.

A · Runs well

Solid speed and headroom. Score 70–84.

B · Decent

Usable speed, moderate headroom. Score 55–69.

C · Tight fit

Limited headroom or sluggish speed. Score 40–54.

D · Barely runs

Slow tokens, very tight on VRAM. Score 20–39.

F · Too heavy

Won't load well or at all. Score 0–19.