▸ DEVICE UNDER TEST
NVIDIA Quadro 5000 SDI — 3 GB VRAM.
▸ QUADRO 5000 SDI SPEC
- BRAND
- NVIDIA
- VRAM
- 3 GB GDDR5
- BANDWIDTH
- 120 GB/s
- FP16 COMPUTE
- 0.7 TFLOPS
- FP32 COMPUTE
- 0.7 TFLOPS
- CUDA CORES
- 352
- TDP
- 172 W
- ARCHITECTURE
- Fermi
▸ AI CAPABILITY
52/ 331 models @ Q4
With 3 GB VRAM and 120 GB/s bandwidth, this GPU handles models up to 2.8B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~14 tok/s.
§ 01TOP MODELS FOR QUADRO 5000 SDI
52 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Dolly v2 3B | 2.8B | 2.2 GB | 34 | 5.6 |
| StableLM Zephyr 3B | 2.79B | 2.2 GB | 34 | 14.9 |
| Zephyr 3B | 2.79B | 2.2 GB | 34 | 14.4 |
| OPT 2.7B | 2.7B | 2.1 GB | 36 | 28.0 |
| Phi-2 2.7B | 2.7B | 2.1 GB | 36 | 24.1 |
| gemma-2-2b | 2.6B | 2.1 GB | 37 | 22.9 |
| CodeGemma 2B | 2.51B | 2.0 GB | 38 | 22.9 |
| EXAONE Deep 2.4B | 2.4B | 2.0 GB | 40 | 27.1 |
| Qwen3.5-2B | 2.3B | 1.9 GB | 42 | 19.3 |
| Qwen2-VL 2B | 2.21B | 1.8 GB | 43 | 28.3 |
| Gemma 1 2B | 2B | 1.7 GB | 48 | 20.2 |
| Granite 3.0 2B | 2B | 1.7 GB | 48 | 35.8 |
| Granite 3.1 2B | 2B | 1.7 GB | 48 | 37.8 |
| Granite 3.3 2B | 2B | 1.7 GB | 48 | 20.5 |
| Qwen 3.5 2B | 2B | 1.7 GB | 48 | 29.7 |
| Moondream2 1.9B | 1.9B | 1.6 GB | 51 | 26.2 |
| Qwen 1.5 1.8B | 1.8B | 1.6 GB | 53 | 19.6 |
| SmolLM2 1.7B | 1.71B | 1.5 GB | 56 | 16.2 |
| Qwen3 1.7B | 1.7B | 1.5 GB | 56 | 30.1 |
| stablelm-2-1_6b | 1.6B | 1.5 GB | 60 | 9.5 |