Qwen 2.5 VL 72B
by Alibaba · qwen-2.5 family
72B
parameters
text-generation vision reasoning multilingual math
Qwen 2.5 VL 72B is Alibaba's flagship vision-language model, offering top-tier multimodal performance at 72 billion parameters. It excels at complex visual reasoning, mathematical problem solving from images, document analysis, and multilingual visual understanding. This model delivers frontier-level vision capabilities with strong performance on benchmarks like MathVista and DocVQA, making it a powerful choice for demanding multimodal workloads when sufficient hardware is available.
Quick Start with Ollama
ollama run 72b-q4_K_M | Creator | Alibaba |
| Parameters | 72B |
| Architecture | transformer-decoder |
| Context | 32K tokens |
| Released | Jan 26, 2025 |
| License | Apache 2.0 |
| Ollama | qwen2.5vl:72b |
Quantization Options
| Format | File Size | VRAM Required | Quality | Ollama Tag |
|---|---|---|---|---|
| Q4_K_M rec | 36 GB | 41 GB | | 72b-q4_K_M |
| Q8_0 | 72 GB | 78 GB | | 72b-q8_0 |
Compatible Hardware
Q4_K_M requires 41 GB VRAM
Benchmark Scores
85.0
mmlu