▸ DEVICE UNDER TEST
NVIDIA Quadro P3200 Mobile — 6 GB VRAM.
▸ QUADRO P3200 MOBILE SPEC
- BRAND
- NVIDIA
- VRAM
- 6 GB GDDR5
- BANDWIDTH
- 168.2 GB/s
- FP16 COMPUTE
- 0.1 TFLOPS
- FP32 COMPUTE
- 5.5 TFLOPS
- CUDA CORES
- 1,792
- TDP
- 75 W
- ARCHITECTURE
- Pascal
▸ AI CAPABILITY
194/ 428 models @ Q4
With 6 GB VRAM and 168.2 GB/s bandwidth, this GPU handles models up to 8B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~19 tok/s.
§ 01TOP MODELS FOR QUADRO P3200 MOBILE
194 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Aya Expanse 8B | 8B | 5.4 GB | 17 | 27.8 |
| Cogito 8B | 8B | 5.4 GB | 17 | 17.2 |
| DeepSeek R1 Distill Llama 8B | 8B | 5.4 GB | 17 | 36.2 |
| Gemma 3n E4B | 8B | 5.4 GB | 17 | 28.8 |
| Granite 3.3 8B | 8B | 5.4 GB | 17 | 24.6 |
| Llama-3.1-8B | 8B | 5.4 GB | 17 | 23.5 |
| Dolphin Llama 3 8B | 8B | 5.4 GB | 17 | 23.8 |
| Llama 3 8B | 8B | 5.4 GB | 17 | 37.3 |
| Tulu 3 8B | 8B | 5.4 GB | 17 | 31.5 |
| Ministral-8B | 8B | 5.4 GB | 17 | 19.3 |
| Nemotron-H 8B | 8B | 5.4 GB | 17 | 78.4 |
| Granite 8B | 8B | 5.4 GB | 17 | 26.1 |
| InternVL2 8B | 8B | 5.4 GB | 17 | 44.6 |
| MiniCPM-V 2.6 8B | 8B | 5.4 GB | 17 | 40.8 |
| RNJ-1 8B | 8B | 5.4 GB | 17 | 53.5 |
| Gemma 4 E4B | 8B | 5.4 GB | 17 | 32.1 |
| InternLM3 8B Instruct | 8B | 5.4 GB | 17 | 38.7 |
| Qwen3-VL 8B Instruct | 8B | 5.4 GB | 17 | 26.4 |
| Qwen3-Embedding 8B | 8B | 5.4 GB | 17 | — |
| Granite 4.1 8B | 8B | 5.4 GB | 17 | 25.1 |