▸ SPEC SHEET
GPT-2 124M — 0.124B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 0.124B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 1K tokens
- CAPABILITIES
- chat
- RELEASE DATE
- 2019-02-14
- PROVIDER
- OpenAI
- FAMILY
- gpt2
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q4_K_M | 4.89 | 0.6 GB | 94% |
| Q5_K_S | 5.57 | 0.6 GB | 96% |
| Q5_K_M | 5.7 | 0.6 GB | 96% |
| Q6_K | 6.56 | 0.6 GB | 97% |
| Q8_0 | 8.5 | 0.6 GB | 100% |
| FP16 | 16 | 0.7 GB | 100% |
§ 01BENCHMARK SCORES
MMLU-PRO1.8
MATH0.2
IFEval17.9
BBH2.7
GPQA1.1
MUSR15.3
§ 03COMPATIBLE GPUs
30 @ Q4_K_MAMD FireGL V7350
1 GB · 41.6 GB/s
NVIDIA Quadro FX 5500
1 GB · 32.3 GB/s
NVIDIA Quadro FX 5500 SDI
1 GB · 32.3 GB/s
AMD Stream Processor
1 GB · 41.5 GB/s
AMD FireGL V8600
1 GB · 111.1 GB/s
NVIDIA GeForce 9600 GT Mac Edition
1 GB · 17 GB/s
NVIDIA GeForce 9600M GS
1 GB · 25.6 GB/s
NVIDIA GeForce 9650M GT
1 GB · 25.6 GB/s
NVIDIA GeForce 9800M GTS
1 GB · 51.2 GB/s
NVIDIA GeForce 9800M GTX
1 GB · 51.2 GB/s
NVIDIA GeForce GTX 280
1 GB · 141.7 GB/s
NVIDIA GeForce GTX 285
1 GB · 159 GB/s
NVIDIA Quadro FX 3700M
1 GB · 51.2 GB/s
NVIDIA Quadro FX 3800M
1 GB · 64 GB/s
NVIDIA Quadro FX 4700 X2
1 GB · 51.2 GB/s