▸ DEVICE UNDER TEST
NVIDIA RTX 4000 Ada Generation — 20 GB VRAM.
▸ RTX 4000 ADA GENERATION SPEC
- BRAND
- NVIDIA
- VRAM
- 20 GB GDDR6
- BANDWIDTH
- 360 GB/s
- FP16 COMPUTE
- 26.7 TFLOPS
- FP32 COMPUTE
- 26.7 TFLOPS
- CUDA CORES
- 6,144
- TENSOR CORES
- 192
- TDP
- 130 W
- ARCHITECTURE
- Ada Lovelace
- MSRP
- $1250
▸ AI CAPABILITY
212/ 331 models @ Q4
With 20 GB VRAM and 360 GB/s bandwidth, this GPU handles models up to 28B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~41 tok/s.
§ 01TOP MODELS FOR RTX 4000 ADA GENERATION
212 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| PaliGemma 2 28B | 28B | 17.6 GB | 10 | 38.6 |
| Qwen3.5-27B | 27.8B | 17.5 GB | 10 | 59.4 |
| gemma-3-27b | 27.4B | 17.2 GB | 11 | 27.2 |
| gemma-2-27b | 27.2B | 17.1 GB | 11 | 34.6 |
| TranslateGemma 27B | 27B | 17.0 GB | 11 | 38.6 |
| Gemma 4 26B A4B | 26B | 16.4 GB | 72 | 47.9 |
| Mistral-Small-24B | 24B | 15.2 GB | 12 | 25.0 |
| Mistral-Small-3.1-24B | 24B | 15.2 GB | 12 | 28.8 |
| Magistral Small 24B | 24B | 15.2 GB | 12 | 47.0 |
| Devstral Small 2 24B | 24B | 15.2 GB | 12 | 33.4 |
| Codestral 22B | 22.2B | 14.1 GB | 13 | 50.1 |
| Devstral Small 22B | 22.2B | 14.1 GB | 13 | 35.5 |
| Mistral Small 22B | 22.2B | 14.1 GB | 13 | 35.2 |
| SOLAR-Pro 22B | 22.1B | 14.0 GB | 13 | 44.2 |
| ERNIE 4.5 21B A3B | 21B | 13.3 GB | 96 | — |
| GPT-OSS 20B | 21B | 13.3 GB | 80 | 52.9 |
| InternLM2 20B | 19.8B | 12.6 GB | 15 | 45.1 |
| InternLM2.5 20B | 19.8B | 12.6 GB | 15 | 50.9 |
| Ling-lite 16.8B | 16.8B | 10.8 GB | 120 | — |
| DeepSeek V2 Lite 16B | 16B | 10.3 GB | 120 | 38.0 |