▸ DEVICE UNDER TEST
AMD Radeon Pro W6800X — 32 GB VRAM.
▸ RADEON PRO W6800X SPEC
- BRAND
- AMD
- VRAM
- 32 GB GDDR6
- BANDWIDTH
- 512 GB/s
- FP16 COMPUTE
- 32.1 TFLOPS
- FP32 COMPUTE
- 16 TFLOPS
- STREAM PROCESSORS
- 3,840
- TDP
- 200 W
- ARCHITECTURE
- RDNA 2.0
▸ AI CAPABILITY
255/ 331 models @ Q4
With 32 GB VRAM and 512 GB/s bandwidth, this GPU handles models up to 41.9B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~59 tok/s.
§ 01TOP MODELS FOR RADEON PRO W6800X
255 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Phi-3.5 MoE 42B | 41.9B | 26.1 GB | 69 | 56.7 |
| Falcon 40B | 40B | 24.9 GB | 11 | 20.9 |
| Qwen3.5-35B-A3B | 36B | 22.5 GB | 13 | 48.5 |
| c4ai-command-r-v01 35B | 35B | 21.9 GB | 13 | 27.5 |
| Qwen 3.5 35B A3B | 35B | 21.9 GB | 152 | 53.3 |
| Qwen 3.6 35B A3B | 35B | 21.9 GB | 152 | 62.7 |
| Nous Capybara 34B | 34.4B | 21.5 GB | 13 | 42.0 |
| Yi-1.5 34B | 34.4B | 21.5 GB | 13 | 45.3 |
| Falcon-H1 34B | 34B | 21.3 GB | 13 | 66.1 |
| CodeLlama 34B | 34B | 21.3 GB | 13 | 25.4 |
| Nous Hermes 2 34B | 34B | 21.3 GB | 13 | 47.0 |
| Phind CodeLlama 34B | 34B | 21.3 GB | 13 | 68.1 |
| LLaVA-1.6 Yi 34B | 34B | 21.3 GB | 13 | 47.4 |
| WizardCoder Python 34B | 34B | 21.3 GB | 13 | 73.2 |
| Yi 34B | 34B | 21.3 GB | 13 | 33.4 |
| DeepSeek Coder 33B | 33B | 20.7 GB | 14 | 26.0 |
| Vicuna 33B | 33B | 20.7 GB | 14 | 17.2 |
| LLaMA 1 30B | 33B | 20.7 GB | 14 | 17.8 |
| DeepSeek-R1-Distill-Qwen-32B | 32.8B | 20.5 GB | 14 | 46.9 |
| Qwen3 32B | 32.8B | 20.5 GB | 14 | 54.9 |