Skip to content

GLM-5

MIT

Zhipu AI · 744B · transformer-decoder

2026-03-15 200K context 744B params

Use Cases

chat code reasoning multilingual tools math

Quantization Options

QuantBitsVRAMQualityStatus
Q2_Krec2300.0 GBModerate

About this model

GLM-5 is Zhipu AI's flagship reasoning model — a 744B parameter Mixture-of-Experts with 40B active parameters per token. It achieves state-of-the-art performance on reasoning and agentic benchmarks, competing with the best closed-source models. At 281 GB even at aggressive 2-bit quantization, GLM-5 requires enterprise-grade hardware — multiple high-VRAM GPUs or a Mac Studio/Pro with 300 GB+ unified memory. Not practical for consumer hardware, but available through Ollama for those with the resources.