HuggingFace/Dense

Zephyr 3B

chat
2.79B
Parameters
4K
Context length
6
Benchmarks
4
Quantizations
50K
HF downloads
Architecture
Dense
Released
2024-01-17
Layers
2
KV Heads
4
Head Dim
4
Family
zephyr

Quantizations & VRAM

Q4_K_M4.5 bpw
2.0 GB
VRAM required
94%
Quality
Q6_K6.5 bpw
2.7 GB
VRAM required
97%
Quality
Q8_08 bpw
3.2 GB
VRAM required
100%
Quality
FP1616 bpw
6.0 GB
VRAM required
100%
Quality

Benchmarks (6)

IFEval49.0
BBH14.8
MUSR9.8
MMLU-PRO8.5
MATH4.3
GPQA0.0

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

Find the best GPU for Zephyr 3B

Build Hardware for Zephyr 3B