granite/Dense

graniteGranite 3.3 8B

chatreasoningcodingtool_use
8B
Parameters
125K
Context length
8
Benchmarks
6
Quantizations
Architecture
Dense
Released
2025-04-16
Layers
32
KV Heads
8
Head Dim
128
Family
granite

Quantization Options

QuantBitsVRAMQuality
Q4_K_M4.895.4 GBgood
Q5_K_S5.576.1 GBgood
Q5_K_M5.76.2 GBgood
Q6_K6.567.0 GBexcellent
Q8_08.59.0 GBlossless
FP161616.5 GBlossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (8)

HumanEval89.7
IFEval74.8
MATH-50069.0
BBH34.7
MMLU-PRO27.9
MATH23.8
MUSR16.8
GPQA8.7

Run this model

Easiest way to get starteddocs →
curl -fsSL https://ollama.com/install.sh | sh
$ollama run granite3.3:8b:q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

Setup guide

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA RTX 3050 6GB
6 GB VRAM • 168 GB/s
NVIDIA
$169
Intel Arc A380
6 GB VRAM • 186 GB/s
INTEL
$129
NVIDIA RTX 2060 6GB
6 GB VRAM • 336 GB/s
NVIDIA
$150
NVIDIA GTX 1660 SUPER
6 GB VRAM • 336 GB/s
NVIDIA
$150
NVIDIA GTX 1660 Ti
6 GB VRAM • 288 GB/s
NVIDIA
$140
NVIDIA GTX 1060 6GB
6 GB VRAM • 192 GB/s
NVIDIA
$80
NVIDIA Tesla C2070
6 GB VRAM • 143 GB/s
NVIDIA
NVIDIA Tesla C2075
6 GB VRAM • 150 GB/s
NVIDIA
NVIDIA Tesla C2090
6 GB VRAM • 177 GB/s
NVIDIA
NVIDIA Tesla M2070
6 GB VRAM • 150 GB/s
NVIDIA
NVIDIA Tesla M2070-Q
6 GB VRAM • 150 GB/s
NVIDIA
NVIDIA Tesla M2075
6 GB VRAM • 150 GB/s
NVIDIA
NVIDIA Tesla M2090
6 GB VRAM • 177 GB/s
NVIDIA
NVIDIA Tesla X2070
6 GB VRAM • 177 GB/s
NVIDIA
NVIDIA Tesla X2090
6 GB VRAM • 177 GB/s
NVIDIA
NVIDIA Tesla K20X
6 GB VRAM • 250 GB/s
NVIDIA
NVIDIA Tesla K20Xm
6 GB VRAM • 250 GB/s
NVIDIA
NVIDIA GeForce GTX 1060 6 GB
6 GB VRAM • 192 GB/s
NVIDIA
NVIDIA GeForce GTX 1060 6 GB 9Gbps
6 GB VRAM • 217 GB/s
NVIDIA
NVIDIA GeForce GTX 1060 6 GB GDDR5X
6 GB VRAM • 192 GB/s
NVIDIA
NVIDIA GeForce GTX 1060 6 GB GP104
6 GB VRAM • 192 GB/s
NVIDIA
NVIDIA GeForce GTX 1060 6 GB Rev. 2
6 GB VRAM • 192 GB/s
NVIDIA
NVIDIA GeForce GTX 1660
6 GB VRAM • 192 GB/s
NVIDIA
NVIDIA GeForce GTX 1660 SUPER
6 GB VRAM • 336 GB/s
NVIDIA
NVIDIA GeForce GTX 1660 Ti
6 GB VRAM • 288 GB/s
NVIDIA
NVIDIA GeForce RTX 2060
6 GB VRAM • 336 GB/s
NVIDIA
$140
NVIDIA GeForce RTX 2060 TU104
6 GB VRAM • 336 GB/s
NVIDIA
$140
AMD Radeon RX 5600 OEM
6 GB VRAM • 288 GB/s
AMD
AMD Radeon RX 5600 XT
6 GB VRAM • 288 GB/s
AMD
$90
AMD Radeon RX 5600M
6 GB VRAM • 288 GB/s
AMD

Find the best GPU for Granite 3.3 8B

Build Hardware for Granite 3.3 8B

Granite 3.3 8B8B Parameter Dense LLM

Model Specifications

Parameters
8B
Architecture
Dense Transformer
Context Length
125K tokens
Capabilities
chat, reasoning, coding, tool_use
Release Date
2025-04-16
Family
granite

VRAM Requirements

QuantizationBPWVRAMQuality
Q4_K_M4.895.4 GB94%
Q5_K_S5.576.1 GB96%
Q5_K_M5.76.2 GB96%
Q6_K6.567.0 GB97%
Q8_08.59.0 GB100%
FP161616.5 GB100%

Benchmark Scores

HumanEval89.7
MMLU-PRO27.9
MATH23.8
IFEval74.8
BBH34.7
GPQA8.7
MUSR16.8
MATH-50069.0

How to Run Granite 3.3 8B

Run Granite 3.3 8B locally with Ollama (needs 5.4 GB VRAM at Q4_K_M):

ollama run granite3.3:8b

Compatible GPUs (30)

GPUs that can run Granite 3.3 8B at Q4_K_M quantization: