Skip to content

Cogito 32B

by Deep Cogito · cogito family

32B

parameters

text-generation code-generation reasoning math multilingual

Cogito 32B is the mid-size variant from Deep Cogito, built on a Qwen 32B base with enhanced reasoning and multilingual capabilities. It delivers strong performance across mathematical reasoning, code generation, and complex problem-solving tasks. With its 131K context window and competitive benchmark scores, Cogito 32B offers an excellent balance of capability and efficiency for users who need more power than the 8B variant but want to stay within consumer GPU memory limits.

Quick Start with Ollama

ollama run 32b-q4_K_M
Resources Ollama Hugging Face Official Page
Creator Deep Cogito
Parameters 32B
Architecture transformer-decoder
Context 128K tokens
Released Mar 24, 2025
License MIT
Ollama cogito:32b

Quantization Options

Format File Size VRAM Required Quality Ollama Tag
Q4_K_M rec 18.5 GB 21.5 GB 32b-q4_K_M
Q8_0 34 GB 37 GB 32b-q8_0
F16 64 GB 68 GB 32b-fp16

Compatible Hardware

Q4_K_M requires 21.5 GB VRAM

Compatible Hardware

HardwareVRAMTypeFitEst. Speed
Mac Studio M4 Ultra 512GB512 GBmacRuns~38 tok/s
Mac Pro M2 Ultra 192GB192 GBmacRuns~37 tok/s
Mac Studio M4 Ultra 192GB192 GBmacRuns~38 tok/s
Mac Studio M4 Max 128GB128 GBmacRuns~25 tok/s
MacBook Pro M4 Max 128GB128 GBmacRuns~25 tok/s
MacBook Pro M5 Max 128GB128 GBmacRuns~25 tok/s
NVIDIA RTX PRO 6000 Blackwell96 GBgpuRuns~89 tok/s
MacBook Pro M3 Max 96GB96 GBmacRuns~19 tok/s
Mac mini M4 Pro 64GB64 GBmacRuns~13 tok/s
Mac Studio M4 Max 64GB64 GBmacRuns~25 tok/s
MacBook Pro M4 Max 64GB64 GBmacRuns~25 tok/s
MacBook Pro M5 Max 64GB64 GBmacRuns~25 tok/s
NVIDIA RTX 6000 Ada Generation48 GBgpuRuns~45 tok/s
NVIDIA RTX A600048 GBgpuRuns~36 tok/s
NVIDIA RTX PRO 5000 Blackwell48 GBgpuRuns~45 tok/s
Mac mini M4 Pro 48GB48 GBmacRuns~13 tok/s
MacBook Pro M3 Max 48GB48 GBmacRuns~19 tok/s
MacBook Pro M4 Pro 48GB48 GBmacRuns~13 tok/s
MacBook Pro M4 Max 48GB48 GBmacRuns~25 tok/s
MacBook Pro M5 Max 48GB48 GBmacRuns~19 tok/s
MacBook Pro M5 Pro 48GB48 GBmacRuns~13 tok/s
Mac Studio M4 Max 36GB36 GBmacRuns~25 tok/s
MacBook Pro M3 Pro 36GB36 GBmacRuns~7 tok/s
MacBook Pro M5 Max 36GB36 GBmacRuns~19 tok/s
NVIDIA RTX 5000 Ada Generation32 GBgpuRuns~34 tok/s
NVIDIA GeForce RTX 509032 GBgpuRuns~83 tok/s
iMac M4 32GB32 GBmacRuns~6 tok/s
Mac mini M4 32GB32 GBmacRuns~6 tok/s
MacBook Air M4 32GB32 GBmacRuns~6 tok/s
MacBook Air M5 32GB32 GBmacRuns~6 tok/s
MacBook Pro M5 32GB32 GBmacRuns~6 tok/s
AMD Radeon RX 7900 XTX24 GBgpuRuns (tight)~45 tok/s
NVIDIA GeForce RTX 3090 Ti24 GBgpuRuns (tight)~47 tok/s
NVIDIA GeForce RTX 309024 GBgpuRuns (tight)~44 tok/s
NVIDIA GeForce RTX 409024 GBgpuRuns (tight)~47 tok/s
NVIDIA RTX A500024 GBgpuRuns (tight)~36 tok/s
iMac M3 24GB24 GBmacRuns (tight)~5 tok/s
Mac mini M2 24GB24 GBmacRuns (tight)~5 tok/s
Mac mini M4 Pro 24GB24 GBmacRuns (tight)~13 tok/s
MacBook Air M2 24GB24 GBmacRuns (tight)~5 tok/s
MacBook Air M4 24GB24 GBmacRuns (tight)~6 tok/s
MacBook Air M5 24GB24 GBmacRuns (tight)~6 tok/s
MacBook Pro M4 Pro 24GB24 GBmacRuns (tight)~13 tok/s
MacBook Pro M5 24GB24 GBmacRuns (tight)~6 tok/s
MacBook Pro M5 Pro 24GB24 GBmacRuns (tight)~13 tok/s
AMD Radeon RX 7900 XT20 GBgpuCPU Offload~11 tok/s
NVIDIA RTX 4000 Ada Generation20 GBgpuCPU Offload~5 tok/s
MacBook Pro M3 Pro 18GB18 GBmacCPU Offload~2 tok/s
AMD Radeon RX 6800 XT16 GBgpuCPU Offload~7 tok/s
AMD Radeon RX 6900 XT16 GBgpuCPU Offload~7 tok/s
AMD Radeon RX 7800 XT16 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 9060 XT 16GB16 GBgpuCPU Offload~8 tok/s
AMD Radeon RX 9070 XT16 GBgpuCPU Offload~9 tok/s
AMD Radeon RX 907016 GBgpuCPU Offload~8 tok/s
Intel Arc A77016 GBgpuCPU Offload~8 tok/s
NVIDIA GeForce RTX 4060 Ti 16GB16 GBgpuCPU Offload~4 tok/s
NVIDIA GeForce RTX 4070 Ti Super16 GBgpuCPU Offload~9 tok/s
NVIDIA GeForce RTX 4080 Super16 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce RTX 408016 GBgpuCPU Offload~10 tok/s
NVIDIA GeForce RTX 5060 Ti 16GB16 GBgpuCPU Offload~6 tok/s
NVIDIA GeForce RTX 5070 Ti16 GBgpuCPU Offload~13 tok/s
NVIDIA GeForce RTX 508016 GBgpuCPU Offload~14 tok/s
NVIDIA RTX A400016 GBgpuCPU Offload~6 tok/s
iMac M1 16GB16 GBmacCPU Offload~1 tok/s
iMac M4 16GB16 GBmacCPU Offload~2 tok/s
Mac mini M1 16GB16 GBmacCPU Offload~1 tok/s
Mac mini M4 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M2 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M3 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M4 16GB16 GBmacCPU Offload~2 tok/s
MacBook Air M5 16GB16 GBmacCPU Offload~2 tok/s
MacBook Pro M1 16GB16 GBmacCPU Offload~1 tok/s
MacBook Pro M2 Pro 16GB16 GBmacCPU Offload~3 tok/s
MacBook Pro M5 16GB16 GBmacCPU Offload~2 tok/s
33 hardware device(s) cannot run this model at Q4_K_M.

Benchmark Scores

83.0
mmlu