Qwen 3 235B-A22B

Name: Qwen 3 235B-A22B
Author: Alibaba

Apache 2.0

Alibaba · 235B · mixture-of-experts

🤗 HuggingFace Ollama Official

2025-04-29 131K context 235B params

Use Cases

chat code reasoning multilingual math tools writing summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	138.0 GB	Good	—
Q8_0	8	245.0 GB	Good	—

About this model

Qwen 3 235B-A22B is Alibaba's largest and most capable model, using a mixture-of-experts architecture with 235 billion total parameters and 22 billion active parameters per token. This design delivers frontier-level performance while keeping per-token compute costs manageable. The model features a 128K context window and excels across code generation, mathematical reasoning, multilingual tasks, tool use, and creative writing. It represents the top of the Qwen 3 lineup, competing with the strongest open-weight models available.

Benchmarks

87.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run qwen3:235b-a22b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 235B
Architecture: mixture-of-experts
Context: 131K tokens
Min VRAM: 138.0 GB
Recommended: 138.0 GB
Family: Qwen 3
Released: 2025-04-29
License: Apache 2.0