▸ DEVICE UNDER TEST
Apple M3 Pro (18GB) — 18 GB VRAM.
▸ APPLE M3 PRO (18GB) SPEC
- BRAND
- Apple
- VRAM
- 18 GB Unified
- BANDWIDTH
- 150 GB/s
- FP16 COMPUTE
- 7.4 TFLOPS
- FP32 COMPUTE
- 7.4 TFLOPS
- TDP
- 30 W
- ARCHITECTURE
- M3 Pro
- MSRP
- $1599
▸ AI CAPABILITY
206/ 331 models @ Q4
With 18 GB VRAM and 150 GB/s bandwidth, this GPU handles models up to 24B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~17 tok/s.
§ 01TOP MODELS FOR APPLE M3 PRO (18GB)
206 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Mistral-Small-24B | 24B | 15.2 GB | 6 | 25.0 |
| Mistral-Small-3.1-24B | 24B | 15.2 GB | 6 | 28.8 |
| Magistral Small 24B | 24B | 15.2 GB | 6 | 47.0 |
| Devstral Small 2 24B | 24B | 15.2 GB | 6 | 33.4 |
| Codestral 22B | 22.2B | 14.1 GB | 6 | 50.1 |
| Devstral Small 22B | 22.2B | 14.1 GB | 6 | 35.5 |
| Mistral Small 22B | 22.2B | 14.1 GB | 6 | 35.2 |
| SOLAR-Pro 22B | 22.1B | 14.0 GB | 6 | 44.2 |
| ERNIE 4.5 21B A3B | 21B | 13.3 GB | 44 | — |
| GPT-OSS 20B | 21B | 13.3 GB | 37 | 52.9 |
| InternLM2 20B | 19.8B | 12.6 GB | 7 | 45.1 |
| InternLM2.5 20B | 19.8B | 12.6 GB | 7 | 50.9 |
| Ling-lite 16.8B | 16.8B | 10.8 GB | 56 | — |
| DeepSeek V2 Lite 16B | 16B | 10.3 GB | 56 | 38.0 |
| DeepSeek-Coder-V2-Lite 15.7B | 15.7B | 10.1 GB | 56 | 43.0 |
| DeepSeek-VL2 Small 16B | 15.7B | 10.1 GB | 56 | 43.1 |
| StarCoder 15B | 15.5B | 10.0 GB | 9 | 21.0 |
| StarCoder2 15B | 15B | 9.7 GB | 9 | 26.5 |
| DeepSeek R1 Distill Qwen 14B | 14.8B | 9.5 GB | 9 | 43.9 |
| DeepCoder 14B | 14.8B | 9.5 GB | 9 | 38.7 |