Skip to content

Gemma 2 27B

by Google · gemma-2 family

27B

parameters

text-generation code-generation reasoning multilingual math creative-writing summarization

Gemma 2 27B is the largest model in Google's Gemma 2 family, delivering top-tier performance among open models in the 20-30B parameter range. It benefits from advanced training techniques including knowledge distillation and RLHF, resulting in highly capable and well-aligned outputs. This model offers excellent performance on reasoning, coding, and creative tasks. While it requires a capable GPU, it fits comfortably on high-end consumer cards at Q4 quantization, making it accessible to enthusiasts with 24GB VRAM GPUs.

Quick Start with Ollama

ollama run 27b-instruct-q4_K_M
Resources Ollama Hugging Face Official Page Research Paper
Creator Google
Parameters 27B
Architecture transformer-decoder
Context 8K tokens
Released Jun 27, 2024
License Gemma Terms of Use
Ollama gemma2:27b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M rec 13.4 GB 17.7 GB 27b-instruct-q4_K_M
Q5_K_M 15.7 GB 20.4 GB 27b-instruct-q5_K_M
Q8_0 24.3 GB 29 GB 27b-instruct-q8_0

Compatible Hardware

Q4_K_M requires 17.7 GB VRAM

Compatible Hardware

HardwareVRAMTypeFitEst. Speed
Mac Studio M4 Ultra 512GB512 GBmacRuns~46 tok/s
Mac Pro M2 Ultra 192GB192 GBmacRuns~45 tok/s
Mac Studio M4 Ultra 192GB192 GBmacRuns~46 tok/s
Mac Studio M4 Max 128GB128 GBmacRuns~31 tok/s
MacBook Pro M4 Max 128GB128 GBmacRuns~31 tok/s
MacBook Pro M5 Max 128GB128 GBmacRuns~31 tok/s
NVIDIA RTX PRO 6000 Blackwell96 GBgpuRuns~108 tok/s
MacBook Pro M3 Max 96GB96 GBmacRuns~23 tok/s
Mac mini M4 Pro 64GB64 GBmacRuns~15 tok/s
Mac Studio M4 Max 64GB64 GBmacRuns~31 tok/s
MacBook Pro M4 Max 64GB64 GBmacRuns~31 tok/s
MacBook Pro M5 Max 64GB64 GBmacRuns~31 tok/s
NVIDIA RTX 6000 Ada Generation48 GBgpuRuns~54 tok/s
NVIDIA RTX A600048 GBgpuRuns~43 tok/s
NVIDIA RTX PRO 5000 Blackwell48 GBgpuRuns~54 tok/s
Mac mini M4 Pro 48GB48 GBmacRuns~15 tok/s
MacBook Pro M3 Max 48GB48 GBmacRuns~23 tok/s
MacBook Pro M4 Pro 48GB48 GBmacRuns~15 tok/s
MacBook Pro M4 Max 48GB48 GBmacRuns~31 tok/s
MacBook Pro M5 Max 48GB48 GBmacRuns~23 tok/s
MacBook Pro M5 Pro 48GB48 GBmacRuns~15 tok/s
Mac Studio M4 Max 36GB36 GBmacRuns~31 tok/s
MacBook Pro M3 Pro 36GB36 GBmacRuns~8 tok/s
MacBook Pro M5 Max 36GB36 GBmacRuns~23 tok/s
NVIDIA RTX 5000 Ada Generation32 GBgpuRuns~41 tok/s
NVIDIA GeForce RTX 509032 GBgpuRuns~101 tok/s
iMac M4 32GB32 GBmacRuns~7 tok/s
Mac mini M4 32GB32 GBmacRuns~7 tok/s
MacBook Air M4 32GB32 GBmacRuns~7 tok/s
MacBook Air M5 32GB32 GBmacRuns~7 tok/s
MacBook Pro M5 32GB32 GBmacRuns~7 tok/s
AMD Radeon RX 7900 XTX24 GBgpuRuns~54 tok/s
NVIDIA GeForce RTX 3090 Ti24 GBgpuRuns~57 tok/s
NVIDIA GeForce RTX 309024 GBgpuRuns~53 tok/s
NVIDIA GeForce RTX 409024 GBgpuRuns~57 tok/s
NVIDIA RTX A500024 GBgpuRuns~43 tok/s
iMac M3 24GB24 GBmacRuns~6 tok/s
Mac mini M2 24GB24 GBmacRuns~6 tok/s
Mac mini M4 Pro 24GB24 GBmacRuns~15 tok/s
MacBook Air M2 24GB24 GBmacRuns~6 tok/s
MacBook Air M4 24GB24 GBmacRuns~7 tok/s
MacBook Air M5 24GB24 GBmacRuns~7 tok/s
MacBook Pro M4 Pro 24GB24 GBmacRuns~15 tok/s
MacBook Pro M5 24GB24 GBmacRuns~7 tok/s
MacBook Pro M5 Pro 24GB24 GBmacRuns~15 tok/s
AMD Radeon RX 7900 XT20 GBgpuRuns (tight)~45 tok/s
NVIDIA RTX 4000 Ada Generation20 GBgpuRuns (tight)~20 tok/s
MacBook Pro M3 Pro 18GB18 GBmacCPU Offload~2 tok/s
AMD Radeon RX 6800 XT16 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 6900 XT16 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 7800 XT16 GBgpuCPU Offload~11 tok/s
AMD Radeon RX 9060 XT 16GB16 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 9070 XT16 GBgpuCPU Offload~11 tok/s
AMD Radeon RX 907016 GBgpuCPU Offload~9 tok/s
Intel Arc A77016 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce RTX 4060 Ti 16GB16 GBgpuCPU Offload~5 tok/s
NVIDIA GeForce RTX 4070 Ti Super16 GBgpuCPU Offload~11 tok/s
NVIDIA GeForce RTX 4080 Super16 GBgpuCPU Offload~13 tok/s
NVIDIA GeForce RTX 408016 GBgpuCPU Offload~12 tok/s
NVIDIA GeForce RTX 5060 Ti 16GB16 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 5070 Ti16 GBgpuCPU Offload~15 tok/s
NVIDIA GeForce RTX 508016 GBgpuCPU Offload~16 tok/s
NVIDIA RTX A400016 GBgpuCPU Offload~8 tok/s
iMac M1 16GB16 GBmacCPU Offload~1 tok/s
iMac M4 16GB16 GBmacCPU Offload~2 tok/s
Mac mini M1 16GB16 GBmacCPU Offload~1 tok/s
Mac mini M4 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M2 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M3 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M4 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M5 16GB16 GBmacCPU Offload~2 tok/s
MacBook Pro M1 16GB16 GBmacCPU Offload~1 tok/s
MacBook Pro M2 Pro 16GB16 GBmacCPU Offload~3 tok/s
MacBook Pro M5 16GB16 GBmacCPU Offload~2 tok/s
AMD Radeon RX 6700 XT12 GBgpuCPU Offload~7 tok/s
AMD Radeon RX 7700 XT12 GBgpuCPU Offload~7 tok/s
Intel Arc B58012 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 3060 12GB12 GBgpuCPU Offload~6 tok/s
NVIDIA GeForce RTX 3080 12GB12 GBgpuCPU Offload~16 tok/s
NVIDIA GeForce RTX 4070 Super12 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 4070 Ti12 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 407012 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 507012 GBgpuCPU Offload~11 tok/s
24 hardware device(s) cannot run this model at Q4_K_M.

Benchmark Scores

75.2
mmlu