NVIDIA GeForce RTX 3080 Ti 20 GB — 20 GB VRAM.
- BRAND
- NVIDIA
- VRAM
- 20 GB GDDR6X
- BANDWIDTH
- 760 GB/s
- FP16 COMPUTE
- 34.1 TFLOPS
- FP32 COMPUTE
- 34.1 TFLOPS
- CUDA CORES
- 10,240
- TENSOR CORES
- 320
- TDP
- 350 W
- ARCHITECTURE
- Ampere
- MSRP
- $1199
With 20 GB VRAM and 760 GB/s bandwidth, this GPU handles models up to 28B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~87 tok/s.
| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| PaliGemma 2 28B | 28B | 17.6 GB | 22 | 38.6 |
| Qwen3.5-27B | 27.8B | 17.5 GB | 22 | 59.4 |
| gemma-3-27b | 27.4B | 17.2 GB | 22 | 27.2 |
| gemma-2-27b | 27.2B | 17.1 GB | 22 | 34.6 |
| TranslateGemma 27B | 27B | 17.0 GB | 23 | 38.6 |
| Gemma 4 26B A4B | 26B | 16.4 GB | 152 | 47.9 |
| Mistral-Small-24B | 24B | 15.2 GB | 25 | 25.0 |
| Mistral-Small-3.1-24B | 24B | 15.2 GB | 25 | 28.8 |
| Magistral Small 24B | 24B | 15.2 GB | 25 | 47.0 |
| Devstral Small 2 24B | 24B | 15.2 GB | 25 | 33.4 |
| Codestral 22B | 22.2B | 14.1 GB | 27 | 50.1 |
| Devstral Small 22B | 22.2B | 14.1 GB | 27 | 35.5 |
| Mistral Small 22B | 22.2B | 14.1 GB | 27 | 35.2 |
| SOLAR-Pro 22B | 22.1B | 14.0 GB | 28 | 44.2 |
| ERNIE 4.5 21B A3B | 21B | 13.3 GB | 203 | — |
| GPT-OSS 20B | 21B | 13.3 GB | 169 | 52.9 |
| InternLM2 20B | 19.8B | 12.6 GB | 31 | 45.1 |
| InternLM2.5 20B | 19.8B | 12.6 GB | 31 | 50.9 |
| Ling-lite 16.8B | 16.8B | 10.8 GB | 253 | — |
| DeepSeek V2 Lite 16B | 16B | 10.3 GB | 253 | 38.0 |