▸ DEVICE UNDER TEST
NVIDIA GRID M60-1Q — 1 GB VRAM.
▸ GRID M60-1Q SPEC
- BRAND
- NVIDIA
- VRAM
- 1 GB GDDR5
- BANDWIDTH
- 160.4 GB/s
- FP16 COMPUTE
- 4.8 TFLOPS
- FP32 COMPUTE
- 4.8 TFLOPS
- CUDA CORES
- 2,048
- TDP
- 225 W
- ARCHITECTURE
- Maxwell 2.0
▸ AI CAPABILITY
13/ 331 models @ Q4
With 1 GB VRAM and 160.4 GB/s bandwidth, this GPU handles models up to 0.6B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~18 tok/s.
§ 01TOP MODELS FOR GRID M60-1Q
13 FIT · SHOWING 13| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Qwen3 0.6B | 0.6B | 0.9 GB | 214 | 19.1 |
| BGE-M3 | 0.568B | 0.8 GB | 226 | 63.0 |
| Falcon-H1 0.5B | 0.5B | 0.8 GB | 257 | 41.7 |
| Qwen 1.5 0.5B | 0.5B | 0.8 GB | 257 | 9.9 |
| Qwen 2.5 0.5B | 0.5B | 0.8 GB | 257 | 19.4 |
| SmolLM2 360M | 0.36B | 0.7 GB | 356 | 8.2 |
| GPT-2 Medium 345M | 0.345B | 0.7 GB | 372 | 5.9 |
| bge-large-en-v1.5 335M | 0.335B | 0.7 GB | 383 | 62.3 |
| mxbai-embed-large-v1 | 0.335B | 0.7 GB | 383 | 64.7 |
| Snowflake Arctic Embed L | 0.335B | 0.7 GB | 383 | 56.0 |
| SmolLM2 135M | 0.135B | 0.6 GB | 951 | 7.0 |
| GPT-2 124M | 0.124B | 0.6 GB | 1035 | 6.5 |
| nomic-embed-text-v1.5 100M | 0.1B | 0.5 GB | 1283 | 62.3 |