▸ DEVICE UNDER TEST
NVIDIA GeForce GTX 580 — 2 GB VRAM.
▸ GEFORCE GTX 580 SPEC
- BRAND
- NVIDIA
- VRAM
- 2 GB GDDR5
- BANDWIDTH
- 192.4 GB/s
- FP16 COMPUTE
- 1.6 TFLOPS
- FP32 COMPUTE
- 1.6 TFLOPS
- CUDA CORES
- 512
- TDP
- 244 W
- ARCHITECTURE
- Fermi 2.0
▸ AI CAPABILITY
26/ 331 models @ Q4
With 2 GB VRAM and 192.4 GB/s bandwidth, this GPU handles models up to 1.3B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~22 tok/s.
§ 01TOP MODELS FOR GEFORCE GTX 580
26 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| DeepSeek Coder 1.3B | 1.3B | 1.3 GB | 118 | 16.8 |
| EXAONE-4.0-1.2B | 1.3B | 1.3 GB | 118 | 18.9 |
| OPT 1.3B | 1.3B | 1.3 GB | 118 | 5.3 |
| Phi-1 1.3B | 1.3B | 1.3 GB | 118 | 7.2 |
| Phi-1.5 1.3B | 1.3B | 1.3 GB | 118 | 7.2 |
| LFM2.5-1.2B-Thinking | 1.2B | 1.2 GB | 128 | 19.6 |
| Llama-3.2-1B | 1.2B | 1.2 GB | 128 | 10.1 |
| TinyLlama 1.1B | 1.1B | 1.2 GB | 140 | 13.6 |
| Falcon3-1B | 1B | 1.1 GB | 154 | 42.0 |
| InternLM2 1B | 1B | 1.1 GB | 154 | — |
| Qwen3.5-0.8B | 0.9B | 1.0 GB | 171 | 20.5 |
| Qwen 3.5 0.8B | 0.8B | 1.0 GB | 192 | 23.4 |
| GPT-2 Large 774M | 0.774B | 1.0 GB | 199 | 5.6 |
| Qwen3 0.6B | 0.6B | 0.9 GB | 257 | 19.1 |
| BGE-M3 | 0.568B | 0.8 GB | 271 | 63.0 |
| Falcon-H1 0.5B | 0.5B | 0.8 GB | 308 | 41.7 |
| Qwen 1.5 0.5B | 0.5B | 0.8 GB | 308 | 9.9 |
| Qwen 2.5 0.5B | 0.5B | 0.8 GB | 308 | 19.4 |
| SmolLM2 360M | 0.36B | 0.7 GB | 428 | 8.2 |
| GPT-2 Medium 345M | 0.345B | 0.7 GB | 446 | 5.9 |