Command R 35B

Name: Command R 35B
Author: Cohere

CC-BY-NC-4.0

Cohere · 35B · transformer-decoder

🤗 HuggingFace Ollama Official

2024-03-11 131K context 35B params

Use Cases

chat reasoning multilingual tools summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	22.5 GB	Good	—
Q5_K_M	5	26.0 GB	Good	—
Q8_0	8	37.0 GB	Excellent	—

About this model

Command R 35B is Cohere's open-weight model optimized for retrieval-augmented generation (RAG), tool use, and enterprise workflows. It supports 128K context and 10 languages, with particular strength in grounded generation that cites sources accurately. This model stands out for its native tool-use capabilities and reliable instruction following. It is well-suited for building AI applications that need to interact with external APIs, databases, and search systems while maintaining factual accuracy.

Benchmarks

75.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run command-r:35b-v0.1-q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 35B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 22.5 GB
Recommended: 22.5 GB
Family: Command R
Released: 2024-03-11
License: CC-BY-NC-4.0