▸ DEVICE UNDER TEST
NVIDIA GeForce 9600M GS — 1 GB VRAM.
▸ GEFORCE 9600M GS SPEC
- BRAND
- NVIDIA
- VRAM
- 1 GB GDDR3
- BANDWIDTH
- 25.6 GB/s
- FP16 COMPUTE
- 0.1 TFLOPS
- FP32 COMPUTE
- 0.1 TFLOPS
- CUDA CORES
- 32
- TDP
- 20 W
- ARCHITECTURE
- Tesla
▸ AI CAPABILITY
23/ 428 models @ Q4
With 1 GB VRAM and 25.6 GB/s bandwidth, this GPU handles models up to 0.6B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~3 tok/s.
§ 01TOP MODELS FOR GEFORCE 9600M GS
23 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Qwen3 0.6B | 0.6B | 0.9 GB | 34 | 19.1 |
| Qwen3-Embedding 0.6B | 0.6B | 0.9 GB | 34 | — |
| Falcon-H1R Tiny 0.6B | 0.6B | 0.9 GB | 34 | — |
| Falcon Perception 0.6B | 0.6B | 0.9 GB | 34 | — |
| BGE-M3 | 0.568B | 0.8 GB | 36 | 63.0 |
| Snowflake Arctic Embed L v2.0 | 0.568B | 0.8 GB | 36 | — |
| Falcon-H1 0.5B | 0.5B | 0.8 GB | 41 | 41.7 |
| Qwen 1.5 0.5B | 0.5B | 0.8 GB | 41 | 9.9 |
| Qwen 2.5 0.5B | 0.5B | 0.8 GB | 41 | 19.4 |
| SmolVLM 500M | 0.5B | 0.8 GB | 41 | — |
| SmolLM2 360M | 0.36B | 0.7 GB | 57 | 8.2 |
| LFM2 350M | 0.35B | 0.7 GB | 59 | 46.3 |
| GPT-2 Medium 345M | 0.345B | 0.7 GB | 59 | 5.9 |
| bge-large-en-v1.5 335M | 0.335B | 0.7 GB | 61 | 62.3 |
| mxbai-embed-large-v1 | 0.335B | 0.7 GB | 61 | 64.7 |
| Snowflake Arctic Embed L | 0.335B | 0.7 GB | 61 | 56.0 |
| Snowflake Arctic Embed M v2.0 | 0.305B | 0.7 GB | 67 | — |
| Gemma 3 270M | 0.27B | 0.7 GB | 76 | 12.6 |
| SmolVLM 256M | 0.256B | 0.6 GB | 80 | 28.3 |
| SmolLM2 135M | 0.135B | 0.6 GB | 152 | 7.0 |