▸ DEVICE UNDER TEST
NVIDIA Tesla K20c — 5 GB VRAM.
▸ TESLA K20C SPEC
- BRAND
- NVIDIA
- VRAM
- 5 GB GDDR5
- BANDWIDTH
- 208 GB/s
- FP16 COMPUTE
- 3.5 TFLOPS
- FP32 COMPUTE
- 3.5 TFLOPS
- CUDA CORES
- 2,496
- TDP
- 225 W
- ARCHITECTURE
- Kepler
▸ AI CAPABILITY
80/ 331 models @ Q4
With 5 GB VRAM and 208 GB/s bandwidth, this GPU handles models up to 6.24B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~24 tok/s.
§ 01TOP MODELS FOR TESLA K20C
80 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| ChatGLM2 6B | 6.24B | 4.3 GB | 27 | 20.7 |
| ChatGLM3 6B | 6.24B | 4.3 GB | 27 | 42.7 |
| Yi-1.5 6B | 6.06B | 4.2 GB | 27 | 28.6 |
| Gemma 3n E2B | 6B | 4.2 GB | 28 | 21.7 |
| Yi 6B | 6B | 4.2 GB | 28 | 23.2 |
| Gemma 4 E2B | 5.1B | 3.6 GB | 33 | 25.7 |
| Qwen3.5-4B | 4.7B | 3.4 GB | 35 | 29.3 |
| InternLM2 5B | 4.5B | 3.2 GB | 37 | 47.6 |
| Gemma 3 4B | 4.3B | 3.1 GB | 39 | 22.8 |
| TranslateGemma 4B | 4B | 2.9 GB | 42 | 5.5 |
| MedGemma 1.5 4B | 4B | 2.9 GB | 42 | 5.5 |
| Qwen 1.5 4B | 4B | 2.9 GB | 42 | 12.6 |
| Qwen3 4B | 4B | 2.9 GB | 42 | 40.7 |
| Qwen 3.5 4B | 4B | 2.9 GB | 42 | 47.4 |
| Nemotron 3 Nano 4B | 3.97B | 2.9 GB | 42 | 32.0 |
| Phi-3.5 Mini 3.8B | 3.82B | 2.8 GB | 44 | 46.6 |
| phi-3-mini-4k 3.8B | 3.8B | 2.8 GB | 44 | 30.5 |
| Phi-4-mini 3.8B | 3.8B | 2.8 GB | 44 | 49.0 |
| Qwen2.5-VL-3B | 3.8B | 2.8 GB | 44 | 29.9 |
| granite-4.0-h-micro 3.2B | 3.2B | 2.4 GB | 52 | 18.4 |