▸ DEVICE UNDER TEST
NVIDIA CMP 50HX — 10 GB VRAM.
▸ CMP 50HX SPEC
- BRAND
- NVIDIA
- VRAM
- 10 GB GDDR6
- BANDWIDTH
- 560 GB/s
- FP16 COMPUTE
- 22.1 TFLOPS
- FP32 COMPUTE
- 11.1 TFLOPS
- CUDA CORES
- 3,584
- TENSOR CORES
- 448
- TDP
- 250 W
- ARCHITECTURE
- Turing
▸ AI CAPABILITY
177/ 331 models @ Q4
With 10 GB VRAM and 560 GB/s bandwidth, this GPU handles models up to 13.1B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~64 tok/s.
§ 01TOP MODELS FOR CMP 50HX
177 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| LLaVA-1.5 13B | 13.1B | 8.5 GB | 34 | 52.1 |
| Baichuan2 13B | 13B | 8.4 GB | 34 | 23.6 |
| Llama 2 13B | 13B | 8.4 GB | 34 | 19.7 |
| CodeLlama 13B | 13B | 8.4 GB | 34 | 19.7 |
| Vicuna 13B | 13B | 8.4 GB | 34 | 11.8 |
| LLaMA 1 13B | 13B | 8.4 GB | 34 | 32.9 |
| OPT 13B | 13B | 8.4 GB | 34 | 35.8 |
| Orca 2 13B | 13B | 8.4 GB | 34 | 25.4 |
| LLaVA-1.6 Vicuna 13B | 13B | 8.4 GB | 34 | 37.4 |
| WizardCoder Python 13B | 13B | 8.4 GB | 34 | 60.1 |
| WizardLM 13B | 13B | 8.4 GB | 34 | 19.5 |
| Mistral-Nemo 12.2B | 12.2B | 7.9 GB | 37 | 22.4 |
| Dolly v2 12B | 12B | 7.8 GB | 37 | 6.4 |
| gemma-3-12b | 12B | 7.8 GB | 37 | 26.6 |
| TranslateGemma 12B | 12B | 7.8 GB | 37 | 35.1 |
| Pixtral 12B | 12B | 7.8 GB | 37 | 52.1 |
| StableLM 2 12B | 12B | 7.8 GB | 37 | 21.3 |
| Falcon2 11B | 11B | 7.2 GB | 41 | 33.2 |
| Llama-3.2-11B-Vision-Instruct | 11B | 7.2 GB | 41 | 34.4 |
| SOLAR-10.7B | 10.7B | 7.0 GB | 42 | 28.2 |