SmolLM2 1.7B

Name: SmolLM2 1.7B
Author: Hugging Face

1.7B

parameters

text-generation code-generation summarization

SmolLM2 1.7B is Hugging Face's compact language model designed for on-device and edge deployment. Trained on 11 trillion tokens from a diverse mix of web data, code, and mathematics datasets, it delivers surprisingly strong performance for its size. Despite being one of the smallest models available, SmolLM2 1.7B outperforms other models in its class on reasoning, knowledge, and instruction-following tasks. Its tiny VRAM footprint makes it ideal for resource-constrained environments where larger models are impractical.

Quick Start with Ollama


ollama run 1.7b-instruct-q8_0

Resources Ollama Hugging Face Official Page

Creator	Hugging Face
Parameters	1.7B
Architecture	transformer-decoder
Context	8K tokens
Released	Nov 2, 2024
License	Apache 2.0
Ollama	smollm2

Quantization Options

Format	File Size	VRAM Required	Ollama Tag
Q4_K_M	1 GB	1.9 GB	`1.7b-instruct-q4_K_M`
Q8_0 rec	1.8 GB	2.7 GB	`1.7b-instruct-q8_0`
F16	3.4 GB	4.4 GB	`1.7b-instruct-fp16`

Compatible Hardware

Q8_0 requires 2.7 GB VRAM

Compatible Hardware

Hardware	VRAM	Type	Fit	Est. Speed
Mac Studio M4 Ultra 512GB	512 GB	mac	Runs	~303 tok/s
Mac Pro M2 Ultra 192GB	192 GB	mac	Runs	~296 tok/s
Mac Studio M4 Ultra 192GB	192 GB	mac	Runs	~303 tok/s
Mac Studio M4 Max 128GB	128 GB	mac	Runs	~202 tok/s
MacBook Pro M4 Max 128GB	128 GB	mac	Runs	~202 tok/s
MacBook Pro M5 Max 128GB	128 GB	mac	Runs	~202 tok/s
NVIDIA RTX PRO 6000 Blackwell	96 GB	gpu	Runs	~711 tok/s
MacBook Pro M3 Max 96GB	96 GB	mac	Runs	~148 tok/s
Mac mini M4 Pro 64GB	64 GB	mac	Runs	~101 tok/s
Mac Studio M4 Max 64GB	64 GB	mac	Runs	~202 tok/s
MacBook Pro M4 Max 64GB	64 GB	mac	Runs	~202 tok/s
MacBook Pro M5 Max 64GB	64 GB	mac	Runs	~202 tok/s
NVIDIA RTX 6000 Ada Generation	48 GB	gpu	Runs	~356 tok/s
NVIDIA RTX A6000	48 GB	gpu	Runs	~284 tok/s
NVIDIA RTX PRO 5000 Blackwell	48 GB	gpu	Runs	~356 tok/s
Mac mini M4 Pro 48GB	48 GB	mac	Runs	~101 tok/s
MacBook Pro M3 Max 48GB	48 GB	mac	Runs	~148 tok/s
MacBook Pro M4 Max 48GB	48 GB	mac	Runs	~202 tok/s
MacBook Pro M4 Pro 48GB	48 GB	mac	Runs	~101 tok/s
MacBook Pro M5 Max 48GB	48 GB	mac	Runs	~152 tok/s
MacBook Pro M5 Pro 48GB	48 GB	mac	Runs	~101 tok/s
Mac Studio M4 Max 36GB	36 GB	mac	Runs	~202 tok/s
MacBook Pro M3 Pro 36GB	36 GB	mac	Runs	~56 tok/s
MacBook Pro M5 Max 36GB	36 GB	mac	Runs	~152 tok/s
NVIDIA RTX 5000 Ada Generation	32 GB	gpu	Runs	~267 tok/s
NVIDIA GeForce RTX 5090	32 GB	gpu	Runs	~664 tok/s
iMac M4 32GB	32 GB	mac	Runs	~44 tok/s
Mac mini M4 32GB	32 GB	mac	Runs	~44 tok/s
MacBook Air M5 32GB	32 GB	mac	Runs	~44 tok/s
MacBook Air M4 32GB	32 GB	mac	Runs	~44 tok/s
MacBook Pro M5 32GB	32 GB	mac	Runs	~44 tok/s
AMD Radeon RX 7900 XTX	24 GB	gpu	Runs	~356 tok/s
NVIDIA GeForce RTX 3090 Ti	24 GB	gpu	Runs	~373 tok/s
NVIDIA GeForce RTX 3090	24 GB	gpu	Runs	~347 tok/s
NVIDIA GeForce RTX 4090	24 GB	gpu	Runs	~373 tok/s
NVIDIA RTX A5000	24 GB	gpu	Runs	~284 tok/s
iMac M3 24GB	24 GB	mac	Runs	~37 tok/s
Mac mini M2 24GB	24 GB	mac	Runs	~37 tok/s
Mac mini M4 Pro 24GB	24 GB	mac	Runs	~101 tok/s
MacBook Air M2 24GB	24 GB	mac	Runs	~37 tok/s
MacBook Air M4 24GB	24 GB	mac	Runs	~44 tok/s
MacBook Air M5 24GB	24 GB	mac	Runs	~44 tok/s
MacBook Pro M4 Pro 24GB	24 GB	mac	Runs	~101 tok/s
MacBook Pro M5 24GB	24 GB	mac	Runs	~44 tok/s
MacBook Pro M5 Pro 24GB	24 GB	mac	Runs	~101 tok/s
AMD Radeon RX 7900 XT	20 GB	gpu	Runs	~296 tok/s
NVIDIA RTX 4000 Ada Generation	20 GB	gpu	Runs	~133 tok/s
MacBook Pro M3 Pro 18GB	18 GB	mac	Runs	~56 tok/s
AMD Radeon RX 6800 XT	16 GB	gpu	Runs	~190 tok/s
AMD Radeon RX 6900 XT	16 GB	gpu	Runs	~190 tok/s
AMD Radeon RX 7800 XT	16 GB	gpu	Runs	~231 tok/s
AMD Radeon RX 9060 XT 16GB	16 GB	gpu	Runs	~199 tok/s
AMD Radeon RX 9070 XT	16 GB	gpu	Runs	~241 tok/s
AMD Radeon RX 9070	16 GB	gpu	Runs	~199 tok/s
Intel Arc A770	16 GB	gpu	Runs	~207 tok/s
NVIDIA GeForce RTX 4060 Ti 16GB	16 GB	gpu	Runs	~107 tok/s
NVIDIA GeForce RTX 4070 Ti Super	16 GB	gpu	Runs	~249 tok/s
NVIDIA GeForce RTX 4080 Super	16 GB	gpu	Runs	~273 tok/s
NVIDIA GeForce RTX 4080	16 GB	gpu	Runs	~266 tok/s
NVIDIA GeForce RTX 5060 Ti 16GB	16 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 5070 Ti	16 GB	gpu	Runs	~332 tok/s
NVIDIA GeForce RTX 5080	16 GB	gpu	Runs	~356 tok/s
NVIDIA RTX A4000	16 GB	gpu	Runs	~166 tok/s
iMac M1 16GB	16 GB	mac	Runs	~25 tok/s
iMac M4 16GB	16 GB	mac	Runs	~44 tok/s
Mac mini M1 16GB	16 GB	mac	Runs	~25 tok/s
Mac mini M4 16GB	16 GB	mac	Runs	~44 tok/s
MacBook Air M2 16GB	16 GB	mac	Runs	~37 tok/s
MacBook Air M4 16GB	16 GB	mac	Runs	~44 tok/s
MacBook Air M3 16GB	16 GB	mac	Runs	~37 tok/s
MacBook Air M5 16GB	16 GB	mac	Runs	~44 tok/s
MacBook Pro M2 Pro 16GB	16 GB	mac	Runs	~74 tok/s
MacBook Pro M1 16GB	16 GB	mac	Runs	~25 tok/s
MacBook Pro M5 16GB	16 GB	mac	Runs	~44 tok/s
AMD Radeon RX 6700 XT	12 GB	gpu	Runs	~142 tok/s
AMD Radeon RX 7700 XT	12 GB	gpu	Runs	~160 tok/s
Intel Arc B580	12 GB	gpu	Runs	~169 tok/s
NVIDIA GeForce RTX 3060 12GB	12 GB	gpu	Runs	~133 tok/s
NVIDIA GeForce RTX 3080 12GB	12 GB	gpu	Runs	~338 tok/s
NVIDIA GeForce RTX 4070 Super	12 GB	gpu	Runs	~187 tok/s
NVIDIA GeForce RTX 4070 Ti	12 GB	gpu	Runs	~187 tok/s
NVIDIA GeForce RTX 4070	12 GB	gpu	Runs	~187 tok/s
NVIDIA GeForce RTX 5070	12 GB	gpu	Runs	~249 tok/s
NVIDIA GeForce GTX 1080 Ti	11 GB	gpu	Runs	~179 tok/s
NVIDIA GeForce RTX 2080 Ti	11 GB	gpu	Runs	~228 tok/s
Intel Arc B570	10 GB	gpu	Runs	~141 tok/s
NVIDIA GeForce RTX 3080 10GB	10 GB	gpu	Runs	~281 tok/s
AMD Radeon RX 6600 XT	8 GB	gpu	Runs	~95 tok/s
AMD Radeon RX 7600	8 GB	gpu	Runs	~107 tok/s
AMD Radeon RX 9060 XT 8GB	8 GB	gpu	Runs	~100 tok/s
Intel Arc A750	8 GB	gpu	Runs	~190 tok/s
NVIDIA GeForce GTX 1070	8 GB	gpu	Runs	~95 tok/s
NVIDIA GeForce RTX 2060 Super	8 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 2070 Super	8 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 2080 Super	8 GB	gpu	Runs	~184 tok/s
NVIDIA GeForce RTX 3050	8 GB	gpu	Runs	~83 tok/s
NVIDIA GeForce RTX 3060 Ti	8 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 3070	8 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 4060 Ti 8GB	8 GB	gpu	Runs	~107 tok/s
NVIDIA GeForce RTX 4060	8 GB	gpu	Runs	~101 tok/s
NVIDIA GeForce RTX 5050	8 GB	gpu	Runs	~83 tok/s
NVIDIA GeForce RTX 5060 Ti 8GB	8 GB	gpu	Runs	~166 tok/s
NVIDIA GeForce RTX 5060	8 GB	gpu	Runs	~124 tok/s
MacBook Air M1 8GB	8 GB	mac	Runs	~25 tok/s
MacBook Air M2 8GB	8 GB	mac	Runs	~37 tok/s
NVIDIA GeForce GTX 1660 Super	6 GB	gpu	Runs	~124 tok/s
NVIDIA GeForce RTX 2060	6 GB	gpu	Runs	~124 tok/s