AMD Radeon RX 6700 XT — 12 GB VRAM.
- BRAND
- AMD
- VRAM
- 12 GB GDDR6
- BANDWIDTH
- 384 GB/s
- FP16 COMPUTE
- 26.4 TFLOPS
- FP32 COMPUTE
- 13.2 TFLOPS
- STREAM PROCESSORS
- 2,560
- TDP
- 230 W
- ARCHITECTURE
- RDNA 2.0
- MSRP
- $250
With 12 GB VRAM and 384 GB/s bandwidth, this GPU handles models up to 16.8B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~44 tok/s.
| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Ling-lite 16.8B | 16.8B | 10.8 GB | 142 | — |
| DeepSeek V2 Lite 16B | 16B | 10.3 GB | 142 | 38.0 |
| DeepSeek-Coder-V2-Lite 15.7B | 15.7B | 10.1 GB | 142 | 43.0 |
| DeepSeek-VL2 Small 16B | 15.7B | 10.1 GB | 142 | 43.1 |
| StarCoder 15B | 15.5B | 10.0 GB | 22 | 21.0 |
| StarCoder2 15B | 15B | 9.7 GB | 23 | 26.5 |
| DeepSeek R1 Distill Qwen 14B | 14.8B | 9.5 GB | 23 | 43.9 |
| DeepCoder 14B | 14.8B | 9.5 GB | 23 | 38.7 |
| Qwen2.5-Coder-14B | 14.8B | 9.5 GB | 23 | 41.3 |
| Qwen2.5-14B | 14.8B | 9.5 GB | 23 | 41.3 |
| Qwen3 14B | 14.8B | 9.5 GB | 23 | 45.7 |
| Ministral 3 14B | 14B | 9.0 GB | 24 | 25.9 |
| Phi-3-medium-14b | 14B | 9.0 GB | 24 | 33.7 |
| phi-4 14B | 14B | 9.0 GB | 24 | 33.7 |
| Phi-4-reasoning 14B | 14B | 9.0 GB | 24 | 33.7 |
| Phi-4-multimodal 14B | 14B | 9.0 GB | 24 | 42.0 |
| Qwen 1.5 14B | 14B | 9.0 GB | 24 | 41.3 |
| LLaVA-1.5 13B | 13.1B | 8.5 GB | 26 | 52.1 |
| Baichuan2 13B | 13B | 8.4 GB | 26 | 23.6 |
| Llama 2 13B | 13B | 8.4 GB | 26 | 19.7 |