Cogito 70B

Name: Cogito 70B
Author: Deep Cogito

MIT

Deep Cogito · 70B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-03-24 131K context 70B params

Use Cases

chat code reasoning math multilingual

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	43.0 GB	Good	—
Q8_0	8	76.0 GB	Good	—

About this model

Cogito 70B is the flagship model from Deep Cogito, built on a Llama 70B base with significantly enhanced reasoning capabilities. It achieves top-tier benchmark results while maintaining the MIT license for maximum flexibility. With an MMLU score of 86.0 and a 131K context window, Cogito 70B is one of the strongest open-weight reasoning models available, though it requires high-end hardware with at least 43GB of VRAM for the Q4 quantization.

Benchmarks

86.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run cogito:70b-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 70B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 43.0 GB
Recommended: 43.0 GB
Family: Cogito
Released: 2025-03-24
License: MIT