Skip to content

Qwen 3 235B-A22B

Apache 2.0

Alibaba · 235B · mixture-of-experts

2025-04-29 131K context 235B params

Use Cases

chat code reasoning multilingual math tools writing summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec4138.0 GBGood
Q8_08245.0 GBGood

About this model

Qwen 3 235B-A22B is Alibaba's largest and most capable model, using a mixture-of-experts architecture with 235 billion total parameters and 22 billion active parameters per token. This design delivers frontier-level performance while keeping per-token compute costs manageable. The model features a 128K context window and excels across code generation, mathematical reasoning, multilingual tasks, tool use, and creative writing. It represents the top of the Qwen 3 lineup, competing with the strongest open-weight models available.

Benchmarks

87.0
mmlu