Command R+ 104B

Name: Command R+ 104B
Author: Cohere

CC-BY-NC-4.0

Cohere · 104B · transformer-decoder

🤗 HuggingFace Ollama Official

2024-04-04 131K context 104B params

Use Cases

chat code reasoning multilingual tools summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_Mrec	4	57.0 GB	Good	—
Q8_0	8	110.0 GB	Excellent	—

About this model

Command R+ 104B is Cohere's largest open-weight model, designed for enterprise retrieval-augmented generation, tool use, and multilingual workflows. It builds on Command R's strengths with significantly improved reasoning, accuracy, and instruction following at 104B parameters. The model supports 128K context and excels at grounded generation with source citations, multi-step tool use, and complex document analysis across 10+ languages. Its size requires substantial hardware but delivers near-frontier performance for RAG and agentic use cases.

Benchmarks

75.7

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run command-r-plus:q4_K_M

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 104B
Architecture: transformer-decoder
Context: 131K tokens
Min VRAM: 57.0 GB
Recommended: 57.0 GB
Family: Command R
Released: 2024-04-04
License: CC-BY-NC-4.0