Skip to content

Qwen 3 235B-A22B

by Alibaba · qwen-3 family

235B

parameters

text-generation code-generation reasoning multilingual math tool-use creative-writing summarization

Qwen 3 235B-A22B is Alibaba's largest and most capable model, using a mixture-of-experts architecture with 235 billion total parameters and 22 billion active parameters per token. This design delivers frontier-level performance while keeping per-token compute costs manageable. The model features a 128K context window and excels across code generation, mathematical reasoning, multilingual tasks, tool use, and creative writing. It represents the top of the Qwen 3 lineup, competing with the strongest open-weight models available.

Quick Start with Ollama

ollama run 235b-a22b-q4_K_M
Resources Ollama Hugging Face Official Page
Creator Alibaba
Parameters 235B
Architecture mixture-of-experts
Context 128K tokens
Released Apr 29, 2025
License Apache 2.0
Ollama qwen3:235b-a22b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M rec 130 GB 138 GB 235b-a22b-q4_K_M
Q8_0 235 GB 245 GB 235b-a22b-q8_0

Compatible Hardware

Q4_K_M requires 138 GB VRAM

Compatible Hardware

HardwareVRAMTypeFitEst. Speed
Mac Studio M4 Ultra 512GB512 GBmacRuns~6 tok/s
Mac Pro M2 Ultra 192GB192 GBmacRuns~6 tok/s
Mac Studio M4 Ultra 192GB192 GBmacRuns~6 tok/s
Mac Studio M4 Max 128GB128 GBmacCPU Offload~1 tok/s
MacBook Pro M4 Max 128GB128 GBmacCPU Offload~1 tok/s
MacBook Pro M5 Max 128GB128 GBmacCPU Offload~1 tok/s
NVIDIA RTX PRO 6000 Blackwell96 GBgpuCPU Offload~4 tok/s
MacBook Pro M3 Max 96GB96 GBmacCPU Offload~1 tok/s
99 hardware device(s) cannot run this model at Q4_K_M.

Benchmark Scores

87.0
mmlu