Qwen 2.5 VL 72B

Name: Qwen 2.5 VL 72B
Author: Alibaba

Apache 2.0

Alibaba · 72B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-01-26 33K context 72B params

Use Cases

chat vision reasoning multilingual math

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	41.0 GB	Good	—
Q8_0	8	78.0 GB	Good	—

About this model

Qwen 2.5 VL 72B is Alibaba's flagship vision-language model, offering top-tier multimodal performance at 72 billion parameters. It excels at complex visual reasoning, mathematical problem solving from images, document analysis, and multilingual visual understanding. This model delivers frontier-level vision capabilities with strong performance on benchmarks like MathVista and DocVQA, making it a powerful choice for demanding multimodal workloads when sufficient hardware is available.

Benchmarks

85.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run qwen2.5vl:72b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 72B
Architecture: transformer-decoder
Context: 33K tokens
Min VRAM: 41.0 GB
Recommended: 41.0 GB
Family: Qwen 2.5
Released: 2025-01-26
License: Apache 2.0