▸ SPEC SHEET
Mistral-Small-24B — 24B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 24B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 32K tokens
- CAPABILITIES
- chat, coding, reasoning, tool_use
- RELEASE DATE
- 2025-01-30
- PROVIDER
- Mistral AI
- FAMILY
- mistral
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q3_K_M | 4 | 12.5 GB | 88% |
| Q3_K_L | 4.3 | 13.4 GB | 90% |
| IQ4_XS | 4.46 | 13.9 GB | 92% |
| Q4_K_S | 4.67 | 14.5 GB | 93% |
| Q4_K_M | 4.89 | 15.2 GB | 94% |
| Q5_K_S | 5.57 | 17.2 GB | 96% |
| Q5_K_M | 5.7 | 17.6 GB | 96% |
| Q6_K | 6.56 | 20.2 GB | 97% |
| Q8_0 | 8.5 | 26.0 GB | 100% |
| FP16 | 16 | 48.5 GB | 100% |
§ 01BENCHMARK SCORES
MMLU-PRO34.4
MATH20.4
IFEval62.8
BBH40.6
GPQA11.1
MUSR10.2
BigCodeBench36.1
Arena Elo1233.0
LiveCodeBench25.2
AIME4.3
MATH-5004.3
GPQA Diamond46.2
HLE4.1
§ 02RUN COMMAND
Run Mistral-Small-24B locally with Ollama — needs 15.2 GB VRAM at Q4_K_M:
$
ollama run mistral-small:24b§ 03COMPATIBLE GPUs
30 @ Q4_K_MNVIDIA RTX 5080
16 GB · 960 GB/s
NVIDIA RTX 5070 Ti
16 GB · 896 GB/s
NVIDIA RTX 4080 SUPER
16 GB · 736 GB/s
NVIDIA RTX 4080
16 GB · 717 GB/s
NVIDIA RTX 4070 Ti SUPER
16 GB · 672 GB/s
NVIDIA RTX 4060 Ti 16GB
16 GB · 288 GB/s
AMD RX 7900 GRE
16 GB · 576 GB/s
AMD RX 7800 XT
16 GB · 624 GB/s
AMD RX 7600 XT
16 GB · 288 GB/s
AMD RX 6950 XT
16 GB · 576 GB/s
AMD RX 6900 XT
16 GB · 512 GB/s
AMD RX 6800 XT
16 GB · 512 GB/s
AMD RX 6800
16 GB · 512 GB/s
Intel Arc A770 16GB
16 GB · 560 GB/s
Apple M1 Pro (16GB)
16 GB · 200 GB/s