Skip to content

Nemotron 3 Nano 8B

NVIDIA Open Model License

NVIDIA · 8B · transformer-decoder

2025-03-18 131K context 8B params

Use Cases

chat code reasoning math tools

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec47.5 GBGood
Q8_0811.0 GBGood
F161619.5 GBExcellent

About this model

Nemotron 3 Nano 8B is NVIDIA's compact language model optimized for efficient inference with built-in tool-use capabilities. It delivers strong performance on reasoning, code generation, and mathematical tasks while supporting function calling out of the box. Designed for practical deployment scenarios, Nemotron 3 Nano combines a 131K context window with an 8B parameter count, making it suitable for running locally on consumer GPUs while retaining the ability to interact with external tools and APIs.