Databricks/Dense

Dolly v2 12B

chat
12B
Parameters
2K
Context length
6
Benchmarks
4
Quantizations
60K
HF downloads
Architecture
Dense
Released
2023-04-12
Layers
36
KV Heads
20
Head Dim
80
Family
dolly

Quantizations & VRAM

Q4_K_M4.5 bpw
7.2 GB
VRAM required
94%
Quality
Q6_K6.5 bpw
10.2 GB
VRAM required
97%
Quality
Q8_08 bpw
12.5 GB
VRAM required
100%
Quality
FP1616 bpw
24.5 GB
VRAM required
100%
Quality

Benchmarks (6)

IFEval23.6
BBH6.4
MUSR5.5
MMLU-PRO1.4
MATH1.4
GPQA0.0

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

Find the best GPU for Dolly v2 12B

Build Hardware for Dolly v2 12B