▸ DEVICE UNDER TEST
NVIDIA RTX 2070 — 8 GB VRAM.
▸ RTX 2070 SPEC
- BRAND
- NVIDIA
- VRAM
- 8 GB GDDR6
- BANDWIDTH
- 448 GB/s
- FP16 COMPUTE
- 14.9 TFLOPS
- FP32 COMPUTE
- 7.5 TFLOPS
- CUDA CORES
- 2,304
- TENSOR CORES
- 288
- TDP
- 175 W
- ARCHITECTURE
- Turing
- MSRP
- $200
▸ AI CAPABILITY
158/ 331 models @ Q4
With 8 GB VRAM and 448 GB/s bandwidth, this GPU handles models up to 10.7B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~51 tok/s.
§ 01TOP MODELS FOR RTX 2070
158 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| SOLAR-10.7B | 10.7B | 7.0 GB | 33 | 28.2 |
| Falcon3-10B | 10.3B | 6.8 GB | 35 | 38.2 |
| Qwen3.5-9B | 9.7B | 6.4 GB | 37 | 38.3 |
| glm-4-9b | 9.4B | 6.2 GB | 38 | 20.5 |
| gemma-2-9b | 9.2B | 6.1 GB | 39 | 30.2 |
| RecurrentGemma 9B | 9B | 6.0 GB | 40 | 35.0 |
| Qwen 3.5 9B | 9B | 6.0 GB | 40 | 50.6 |
| Yi 1.5 9B | 9B | 6.0 GB | 40 | 30.3 |
| Yi Coder 9B | 9B | 6.0 GB | 40 | 35.8 |
| NVIDIA-Nemotron-Nano-9B-v2 | 8.9B | 5.9 GB | 40 | 44.2 |
| CodeGemma 7B | 8.54B | 5.7 GB | 42 | 40.2 |
| Qwen2.5-VL-7B | 8.3B | 5.6 GB | 43 | 39.1 |
| Qwen2-VL 7B | 8.29B | 5.6 GB | 43 | 46.6 |
| Qwen3-8B | 8.2B | 5.5 GB | 44 | 43.3 |
| Granite 3.0 8B | 8.17B | 5.5 GB | 44 | 36.4 |
| Granite 3.1 8B | 8.17B | 5.5 GB | 44 | 38.6 |
| Aya Expanse 8B | 8B | 5.4 GB | 45 | 27.8 |
| Cogito 8B | 8B | 5.4 GB | 45 | 17.2 |
| DeepSeek R1 Distill Llama 8B | 8B | 5.4 GB | 45 | 36.2 |
| Gemma 3n E4B | 8B | 5.4 GB | 45 | 28.8 |