IBM/Dense

granite-4.0-h-micro 3.2B

Name: granite-4.0-h-micro 3.2B
Author: IBM

chat

3.2B

Parameters

128K

Context length

Benchmarks

Quantizations

18K

HF downloads

Architecture

Dense

Released

2025-09-16

Layers

KV Heads

Head Dim

Family

granite

Quantization Options

Quant	Bits	VRAM	Quality
Q4_K_M	4.89	2.4 GB	good
Q5_K_S	5.57	2.7 GB	good
Q5_K_M	5.7	2.8 GB	good
Q6_K	6.56	3.1 GB	excellent
Q8_0	8.5	3.9 GB	lossless
FP16	16	6.9 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (6)

IFEval51.4

BBH21.7

MMLU-PRO20.2

MATH9.2

GPQA6.6

MUSR1.3

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run granite:3b-q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

Setup guide

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA Quadro 5000

3 GB VRAM • 120 GB/s

NVIDIA

NVIDIA Quadro 5000 SDI

NVIDIA GeForce GTX 670MX

AMD Radeon HD 7950 Boost

3 GB VRAM • 240 GB/s

AMD

AMD Radeon HD 7950 Monica BIOS 1

3 GB VRAM • 240 GB/s

AMD

AMD Radeon HD 7950 Monica BIOS 2

AMD Radeon HD 7970 GHz Edition

3 GB VRAM • 288 GB/s

AMD

AMD Radeon HD 7970 X2

3 GB VRAM • 264 GB/s

AMD

NVIDIA GeForce GTX 770M

3 GB VRAM • 96 GB/s

NVIDIA

NVIDIA GeForce GTX 780

3 GB VRAM • 288 GB/s

NVIDIA

NVIDIA GeForce GTX 780 Rev. 2

3 GB VRAM • 288 GB/s

NVIDIA

NVIDIA GeForce GTX 780 Ti

3 GB VRAM • 337 GB/s

NVIDIA

AMD Radeon HD 7950 Mac Edition

AMD Radeon HD 8950 OEM

3 GB VRAM • 240 GB/s

AMD

AMD Radeon HD 8970 OEM

3 GB VRAM • 264 GB/s

AMD

AMD Radeon HD 8990 OEM

3 GB VRAM • 288 GB/s

AMD

NVIDIA GeForce GTX 870M

NVIDIA GeForce GTX 1060 3 GB

3 GB VRAM • 192 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 3 GB GP104

NVIDIA GeForce GTX 1050 3 GB

AMD Radeon RX 5300 OEM

3 GB VRAM • 168 GB/s

AMD

Find the best GPU for granite-4.0-h-micro 3.2B

Build Hardware for granite-4.0-h-micro 3.2B

granite-4.0-h-micro 3.2B

Quantization Options

Benchmarks (6)

Run this model

GPUs that can run this model

granite-4.0-h-micro 3.2B — 3.2B Parameter Dense LLM

Model Specifications

VRAM Requirements

Benchmark Scores

How to Run granite-4.0-h-micro 3.2B

Compatible GPUs (30)