NVIDIA/Dense

NVIDIA-Nemotron-Nano-9B-v2

Name: NVIDIA-Nemotron-Nano-9B-v2
Author: NVIDIA

chat

8.9B

Parameters

128K

Context length

Benchmarks

Quantizations

381K

HF downloads

Architecture

Dense

Released

2025-08-12

Layers

KV Heads

Head Dim

128

Family

nemotron

Quantization Options

Quant	Bits	VRAM	Quality
Q4_K_M	4.89	5.9 GB	good
Q5_K_S	5.57	6.7 GB	good
Q5_K_M	5.7	6.8 GB	good
Q6_K	6.56	7.8 GB	excellent
Q8_0	8.5	9.9 GB	lossless
FP16	16	18.3 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (6)

MATH-50097.8

IFEval90.3

AIME72.1

LiveCodeBench71.1

GPQA Diamond64.5

HLE6.5

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run nemotron:8b-q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

Setup guide

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA GTX 1660 SUPER

NVIDIA GeForce GTX 1060 6 GB

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB 9Gbps

6 GB VRAM • 217 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB GDDR5X

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB GP104

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB Rev. 2

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1660

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1660 SUPER

6 GB VRAM • 336 GB/s

NVIDIA

NVIDIA GeForce GTX 1660 Ti

6 GB VRAM • 288 GB/s

NVIDIA

NVIDIA GeForce RTX 2060

6 GB VRAM • 336 GB/s

NVIDIA

$140

NVIDIA GeForce RTX 2060 TU104

6 GB VRAM • 336 GB/s

NVIDIA

$140

AMD Radeon RX 5600 OEM

6 GB VRAM • 288 GB/s

AMD

AMD Radeon RX 5600 XT

Find the best GPU for NVIDIA-Nemotron-Nano-9B-v2

Build Hardware for NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2

Quantization Options

Benchmarks (6)

Run this model

GPUs that can run this model

NVIDIA-Nemotron-Nano-9B-v2 — 8.9B Parameter Dense LLM

Model Specifications

VRAM Requirements

Benchmark Scores

How to Run NVIDIA-Nemotron-Nano-9B-v2

Compatible GPUs (30)