Qwen 3 14B

Name: Qwen 3 14B
Author: Alibaba

Apache 2.0

Alibaba · 14B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-04-29 131K context 14B params

Use Cases

chat code reasoning multilingual math tools writing summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	12.0 GB	Good	—
Q8_0	8	19.0 GB	Excellent	—
F16	16	33.0 GB	Excellent	—

About this model

Qwen 3 14B is one of the strongest mid-range models available, excelling at coding, math, reasoning, and creative tasks. Hybrid thinking mode lets it match the performance of much larger models on complex problems while staying fast on simple ones. At Q4 it fits on 16 GB GPUs or Macs, making it a top pick for the RTX 4060 Ti 16GB, RTX 5070 Ti, or M-series Macs with 16 GB. A strong contender for the best daily driver at this VRAM tier.

Benchmarks

80.5

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run qwen3:14b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 14B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 12.0 GB
Recommended: 12.0 GB
Family: Qwen 3
Released: 2025-04-29
License: Apache 2.0