▸ DEVICE UNDER TEST
Apple M3 Max (64GB) — 64 GB VRAM.
▸ APPLE M3 MAX (64GB) SPEC
- BRAND
- Apple
- VRAM
- 64 GB LPDDR5
- BANDWIDTH
- 300 GB/s
- FP16 COMPUTE
- 14.2 TFLOPS
- FP32 COMPUTE
- 14.2 TFLOPS
- TDP
- 60 W
- ARCHITECTURE
- M3 Max
- MSRP
- $2799
▸ AI CAPABILITY
281/ 331 models @ Q4
With 64 GB VRAM and 300 GB/s bandwidth, this GPU handles models up to 90B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~34 tok/s.
§ 01TOP MODELS FOR APPLE M3 MAX (64GB)
281 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Llama-3.2-90B-Vision-Instruct | 90B | 55.5 GB | 3 | 48.5 |
| Hunyuan A13B | 80B | 49.4 GB | 21 | 81.1 |
| Qwen3-Coder-Next | 80B | 49.4 GB | 89 | 43.0 |
| Qwen2.5-72B | 72.7B | 44.9 GB | 4 | 39.7 |
| Qwen2-VL 72B | 72.7B | 44.9 GB | 4 | 55.5 |
| Qwen 1.5 72B | 72B | 44.5 GB | 4 | 49.7 |
| Qwen2 Math 72B | 72B | 44.5 GB | 4 | 49.7 |
| DeepSeek R1 Distill Llama 70B | 70.6B | 43.6 GB | 4 | 42.4 |
| Llama 3.3 70B | 70.6B | 43.6 GB | 4 | 44.8 |
| Llama 3.1 70B | 70.6B | 43.6 GB | 4 | 33.2 |
| Llama 3 70B | 70.6B | 43.6 GB | 4 | 44.1 |
| Llama-3.1-Nemotron-70B | 70.6B | 43.6 GB | 4 | 43.7 |
| Cogito 70B | 70B | 43.3 GB | 4 | — |
| Llama 2 70B | 70B | 43.3 GB | 4 | 33.4 |
| CodeLlama 70B | 70B | 43.3 GB | 4 | 45.7 |
| Dolphin Llama 3 70B | 70B | 43.3 GB | 4 | 45.7 |
| Tulu 3 70B | 70B | 43.3 GB | 4 | 59.4 |
| WizardLM 70B | 70B | 43.3 GB | 4 | 28.5 |
| OPT 66B | 66B | 40.8 GB | 4 | — |
| LLaMA 1 65B | 65.2B | 40.3 GB | 4 | 42.6 |