granite/Dense

Granite 3.3 8B

Name: Granite 3.3 8B
Author: granite

chatreasoningcodingtool_use

Parameters

125K

Context length

Benchmarks

Quantizations

Architecture

Dense

Released

2025-04-16

Layers

KV Heads

Head Dim

128

Family

granite

Quantization Options

Quant	Bits	VRAM	Quality
Q4_K_M	4.89	5.4 GB	good
Q5_K_S	5.57	6.1 GB	good
Q5_K_M	5.7	6.2 GB	good
Q6_K	6.56	7.0 GB	excellent
Q8_0	8.5	9.0 GB	lossless
FP16	16	16.5 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (8)

HumanEval89.7

IFEval74.8

MATH-50069.0

BBH34.7

MMLU-PRO27.9

MATH23.8

MUSR16.8

GPQA8.7

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run granite3.3:8b:q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

Setup guide

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA GTX 1660 SUPER

NVIDIA GeForce GTX 1060 6 GB

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB 9Gbps

6 GB VRAM • 217 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB GDDR5X

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB GP104

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB Rev. 2

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1660

6 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1660 SUPER

6 GB VRAM • 336 GB/s

NVIDIA

NVIDIA GeForce GTX 1660 Ti

6 GB VRAM • 288 GB/s

NVIDIA

NVIDIA GeForce RTX 2060

6 GB VRAM • 336 GB/s

NVIDIA

$140

NVIDIA GeForce RTX 2060 TU104

6 GB VRAM • 336 GB/s

NVIDIA

$140

AMD Radeon RX 5600 OEM

6 GB VRAM • 288 GB/s

AMD

AMD Radeon RX 5600 XT

Find the best GPU for Granite 3.3 8B

Build Hardware for Granite 3.3 8B

Granite 3.3 8B

Quantization Options

Benchmarks (8)

Run this model

GPUs that can run this model

Granite 3.3 8B — 8B Parameter Dense LLM

Model Specifications

VRAM Requirements

Benchmark Scores

How to Run Granite 3.3 8B

Compatible GPUs (30)