InternLM 2.5 20B
by Shanghai AI Lab · internlm-2.5 family
20B
parameters
text-generation code-generation reasoning multilingual tool-use math creative-writing
InternLM 2.5 20B is the larger variant in Shanghai AI Lab's InternLM 2.5 series, delivering substantially improved reasoning and generation quality over the 7B model. It supports an extraordinary 1M token context length and offers strong performance across mathematical reasoning, code generation, multilingual tasks, and creative writing. With its Apache 2.0 license and robust tool-use capabilities, the 20B model is well-suited for complex agentic applications and professional workflows. It provides a compelling balance of capability and efficiency for users with mid-range to high-end hardware.
Quick Start with Ollama
ollama run 20b-q4_K_M | Creator | Shanghai AI Lab |
| Parameters | 20B |
| Architecture | transformer-decoder |
| Context | 1024K tokens |
| Released | Jul 3, 2024 |
| License | Apache 2.0 |
| Ollama | internlm2:20b |
Quantization Options
| Format | File Size | VRAM Required | Quality | Ollama Tag |
|---|---|---|---|---|
| Q4_K_M rec | 10.5 GB | 12 GB | | 20b-q4_K_M |
| Q8_0 | 18.5 GB | 22 GB | | 20b-q8_0 |
| F16 | 37 GB | 42 GB | | 20b-f16 |
Compatible Hardware
Q4_K_M requires 12 GB VRAM
Benchmark Scores
78.0
mmlu