▸ DEVICE UNDER TEST
NVIDIA Quadro Plex 7000 — 6 GB VRAM.
▸ QUADRO PLEX 7000 SPEC
- BRAND
- NVIDIA
- VRAM
- 6 GB GDDR5
- BANDWIDTH
- 144 GB/s
- FP16 COMPUTE
- 1.2 TFLOPS
- FP32 COMPUTE
- 1.2 TFLOPS
- CUDA CORES
- 512
- TDP
- 600 W
- ARCHITECTURE
- Fermi 2.0
▸ AI CAPABILITY
142/ 331 models @ Q4
With 6 GB VRAM and 144 GB/s bandwidth, this GPU handles models up to 8B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~16 tok/s.
§ 01TOP MODELS FOR QUADRO PLEX 7000
142 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Aya Expanse 8B | 8B | 5.4 GB | 14 | 27.8 |
| Cogito 8B | 8B | 5.4 GB | 14 | 17.2 |
| DeepSeek R1 Distill Llama 8B | 8B | 5.4 GB | 14 | 36.2 |
| Gemma 3n E4B | 8B | 5.4 GB | 14 | 28.8 |
| Granite 3.3 8B | 8B | 5.4 GB | 14 | 24.6 |
| Llama-3.1-8B | 8B | 5.4 GB | 14 | 23.5 |
| Dolphin Llama 3 8B | 8B | 5.4 GB | 14 | 23.8 |
| Llama 3 8B | 8B | 5.4 GB | 14 | 37.3 |
| Tulu 3 8B | 8B | 5.4 GB | 14 | 31.5 |
| Ministral-8B | 8B | 5.4 GB | 14 | 19.3 |
| Nemotron-H 8B | 8B | 5.4 GB | 14 | 78.4 |
| Granite 8B | 8B | 5.4 GB | 14 | 26.1 |
| InternVL2 8B | 8B | 5.4 GB | 14 | 44.6 |
| MiniCPM-V 2.6 8B | 8B | 5.4 GB | 14 | 40.8 |
| RNJ-1 8B | 8B | 5.4 GB | 14 | 53.5 |
| Gemma 4 E4B | 8B | 5.4 GB | 14 | 32.1 |
| EXAONE Deep 7.8B | 7.8B | 5.3 GB | 15 | 41.4 |
| InternLM2.5 7B | 7.74B | 5.2 GB | 15 | 44.8 |
| Qwen2.5-7B | 7.6B | 5.1 GB | 15 | 35.2 |
| Qwen2.5-Coder-7B | 7.6B | 5.1 GB | 15 | 31.0 |