Skip to content

Falcon 3 10B

by TII · falcon family

10B

parameters

text-generation code-generation reasoning multilingual

Falcon 3 10B is the mid-size variant of TII's third-generation Falcon language model family. With 10B parameters, it offers improved performance over the 7B variant while remaining practical for local deployment on consumer hardware. Developed by the Technology Innovation Institute, Falcon 3 10B provides strong multilingual support and competitive reasoning capabilities, making it a versatile choice for general-purpose text generation and code tasks.

Quick Start with Ollama

ollama run 10b-q4_K_M
Resources Ollama Hugging Face Official Page
Creator TII
Parameters 10B
Architecture transformer-decoder
Context 32K tokens
Released Dec 11, 2024
License TII Falcon License 2.0
Ollama falcon3:10b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M rec 6 GB 8.5 GB 10b-q4_K_M
Q8_0 10.5 GB 13 GB 10b-q8_0
F16 20 GB 24 GB 10b-fp16

Compatible Hardware

Q4_K_M requires 8.5 GB VRAM

Compatible Hardware

HardwareVRAMTypeFitEst. Speed
Mac Studio M4 Ultra 512GB512 GBmacRuns~96 tok/s
Mac Pro M2 Ultra 192GB192 GBmacRuns~94 tok/s
Mac Studio M4 Ultra 192GB192 GBmacRuns~96 tok/s
Mac Studio M4 Max 128GB128 GBmacRuns~64 tok/s
MacBook Pro M4 Max 128GB128 GBmacRuns~64 tok/s
MacBook Pro M5 Max 128GB128 GBmacRuns~64 tok/s
NVIDIA RTX PRO 6000 Blackwell96 GBgpuRuns~226 tok/s
MacBook Pro M3 Max 96GB96 GBmacRuns~47 tok/s
Mac mini M4 Pro 64GB64 GBmacRuns~32 tok/s
Mac Studio M4 Max 64GB64 GBmacRuns~64 tok/s
MacBook Pro M4 Max 64GB64 GBmacRuns~64 tok/s
MacBook Pro M5 Max 64GB64 GBmacRuns~64 tok/s
NVIDIA RTX 6000 Ada Generation48 GBgpuRuns~113 tok/s
NVIDIA RTX A600048 GBgpuRuns~90 tok/s
NVIDIA RTX PRO 5000 Blackwell48 GBgpuRuns~113 tok/s
Mac mini M4 Pro 48GB48 GBmacRuns~32 tok/s
MacBook Pro M3 Max 48GB48 GBmacRuns~47 tok/s
MacBook Pro M4 Pro 48GB48 GBmacRuns~32 tok/s
MacBook Pro M4 Max 48GB48 GBmacRuns~64 tok/s
MacBook Pro M5 Max 48GB48 GBmacRuns~48 tok/s
MacBook Pro M5 Pro 48GB48 GBmacRuns~32 tok/s
Mac Studio M4 Max 36GB36 GBmacRuns~64 tok/s
MacBook Pro M3 Pro 36GB36 GBmacRuns~18 tok/s
MacBook Pro M5 Max 36GB36 GBmacRuns~48 tok/s
NVIDIA RTX 5000 Ada Generation32 GBgpuRuns~85 tok/s
NVIDIA GeForce RTX 509032 GBgpuRuns~211 tok/s
iMac M4 32GB32 GBmacRuns~14 tok/s
Mac mini M4 32GB32 GBmacRuns~14 tok/s
MacBook Air M4 32GB32 GBmacRuns~14 tok/s
MacBook Air M5 32GB32 GBmacRuns~14 tok/s
MacBook Pro M5 32GB32 GBmacRuns~14 tok/s
AMD Radeon RX 7900 XTX24 GBgpuRuns~113 tok/s
NVIDIA GeForce RTX 3090 Ti24 GBgpuRuns~119 tok/s
NVIDIA GeForce RTX 309024 GBgpuRuns~110 tok/s
NVIDIA GeForce RTX 409024 GBgpuRuns~119 tok/s
NVIDIA RTX A500024 GBgpuRuns~90 tok/s
iMac M3 24GB24 GBmacRuns~12 tok/s
Mac mini M2 24GB24 GBmacRuns~12 tok/s
Mac mini M4 Pro 24GB24 GBmacRuns~32 tok/s
MacBook Air M2 24GB24 GBmacRuns~12 tok/s
MacBook Air M4 24GB24 GBmacRuns~14 tok/s
MacBook Air M5 24GB24 GBmacRuns~14 tok/s
MacBook Pro M4 Pro 24GB24 GBmacRuns~32 tok/s
MacBook Pro M5 24GB24 GBmacRuns~14 tok/s
MacBook Pro M5 Pro 24GB24 GBmacRuns~32 tok/s
AMD Radeon RX 7900 XT20 GBgpuRuns~94 tok/s
NVIDIA RTX 4000 Ada Generation20 GBgpuRuns~42 tok/s
MacBook Pro M3 Pro 18GB18 GBmacRuns~18 tok/s
AMD Radeon RX 6800 XT16 GBgpuRuns~60 tok/s
AMD Radeon RX 6900 XT16 GBgpuRuns~60 tok/s
AMD Radeon RX 7800 XT16 GBgpuRuns~73 tok/s
AMD Radeon RX 9060 XT 16GB16 GBgpuRuns~63 tok/s
AMD Radeon RX 9070 XT16 GBgpuRuns~76 tok/s
AMD Radeon RX 907016 GBgpuRuns~63 tok/s
Intel Arc A77016 GBgpuRuns~66 tok/s
NVIDIA GeForce RTX 4060 Ti 16GB16 GBgpuRuns~34 tok/s
NVIDIA GeForce RTX 4070 Ti Super16 GBgpuRuns~79 tok/s
NVIDIA GeForce RTX 4080 Super16 GBgpuRuns~87 tok/s
NVIDIA GeForce RTX 408016 GBgpuRuns~84 tok/s
NVIDIA GeForce RTX 5060 Ti 16GB16 GBgpuRuns~53 tok/s
NVIDIA GeForce RTX 5070 Ti16 GBgpuRuns~105 tok/s
NVIDIA GeForce RTX 508016 GBgpuRuns~113 tok/s
NVIDIA RTX A400016 GBgpuRuns~53 tok/s
iMac M1 16GB16 GBmacRuns~8 tok/s
iMac M4 16GB16 GBmacRuns~14 tok/s
Mac mini M1 16GB16 GBmacRuns~8 tok/s
Mac mini M4 16GB16 GBmacRuns~14 tok/s
MacBook Air M2 16GB16 GBmacRuns~12 tok/s
MacBook Air M3 16GB16 GBmacRuns~12 tok/s
MacBook Air M4 16GB16 GBmacRuns~14 tok/s
MacBook Air M5 16GB16 GBmacRuns~14 tok/s
MacBook Pro M1 16GB16 GBmacRuns~8 tok/s
MacBook Pro M2 Pro 16GB16 GBmacRuns~24 tok/s
MacBook Pro M5 16GB16 GBmacRuns~14 tok/s
AMD Radeon RX 6700 XT12 GBgpuRuns~45 tok/s
AMD Radeon RX 7700 XT12 GBgpuRuns~51 tok/s
Intel Arc B58012 GBgpuRuns~54 tok/s
NVIDIA GeForce RTX 3060 12GB12 GBgpuRuns~42 tok/s
NVIDIA GeForce RTX 3080 12GB12 GBgpuRuns~107 tok/s
NVIDIA GeForce RTX 4070 Super12 GBgpuRuns~59 tok/s
NVIDIA GeForce RTX 4070 Ti12 GBgpuRuns~59 tok/s
NVIDIA GeForce RTX 407012 GBgpuRuns~59 tok/s
NVIDIA GeForce RTX 507012 GBgpuRuns~79 tok/s
NVIDIA GeForce GTX 1080 Ti11 GBgpuRuns~57 tok/s
NVIDIA GeForce RTX 2080 Ti11 GBgpuRuns~72 tok/s
Intel Arc B57010 GBgpuRuns~45 tok/s
NVIDIA GeForce RTX 3080 10GB10 GBgpuRuns~89 tok/s
AMD Radeon RX 6600 XT8 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 76008 GBgpuCPU Offload~10 tok/s
Intel Arc A7508 GBgpuCPU Offload~18 tok/s
AMD Radeon RX 9060 XT 8GB8 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce GTX 10708 GBgpuCPU Offload~9 tok/s
NVIDIA GeForce RTX 2060 Super8 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 2070 Super8 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 2080 Super8 GBgpuCPU Offload~17 tok/s
NVIDIA GeForce RTX 30508 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 3060 Ti8 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 30708 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 40608 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce RTX 4060 Ti 8GB8 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce RTX 50508 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 5060 Ti 8GB8 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 50608 GBgpuCPU Offload~12 tok/s
MacBook Air M1 8GB8 GBmacCPU Offload~2 tok/s
MacBook Air M2 8GB8 GBmacCPU Offload~4 tok/s
NVIDIA GeForce GTX 1660 Super6 GBgpuCPU Offload~12 tok/s
NVIDIA GeForce RTX 20606 GBgpuCPU Offload~12 tok/s

Benchmark Scores

72.0
mmlu