▸ DEVICE UNDER TEST
NVIDIA Tesla M2070-Q — 6 GB VRAM.
▸ TESLA M2070-Q SPEC
- BRAND
- NVIDIA
- VRAM
- 6 GB GDDR5
- BANDWIDTH
- 150 GB/s
- FP16 COMPUTE
- 1 TFLOPS
- FP32 COMPUTE
- 1 TFLOPS
- CUDA CORES
- 448
- TDP
- 225 W
- ARCHITECTURE
- Fermi
▸ AI CAPABILITY
142/ 331 models @ Q4
With 6 GB VRAM and 150 GB/s bandwidth, this GPU handles models up to 8B parameters.
Speed ≈ bandwidth / model_size × efficiency. A 7B model at Q4 runs at ~17 tok/s.
§ 01TOP MODELS FOR TESLA M2070-Q
142 FIT · SHOWING 20| MODEL | SIZE | VRAM Q4 | TOK/S | AVG |
|---|---|---|---|---|
| Aya Expanse 8B | 8B | 5.4 GB | 15 | 27.8 |
| Cogito 8B | 8B | 5.4 GB | 15 | 17.2 |
| DeepSeek R1 Distill Llama 8B | 8B | 5.4 GB | 15 | 36.2 |
| Gemma 3n E4B | 8B | 5.4 GB | 15 | 28.8 |
| Granite 3.3 8B | 8B | 5.4 GB | 15 | 24.6 |
| Llama-3.1-8B | 8B | 5.4 GB | 15 | 23.5 |
| Dolphin Llama 3 8B | 8B | 5.4 GB | 15 | 23.8 |
| Llama 3 8B | 8B | 5.4 GB | 15 | 37.3 |
| Tulu 3 8B | 8B | 5.4 GB | 15 | 31.5 |
| Ministral-8B | 8B | 5.4 GB | 15 | 19.3 |
| Nemotron-H 8B | 8B | 5.4 GB | 15 | 78.4 |
| Granite 8B | 8B | 5.4 GB | 15 | 26.1 |
| InternVL2 8B | 8B | 5.4 GB | 15 | 44.6 |
| MiniCPM-V 2.6 8B | 8B | 5.4 GB | 15 | 40.8 |
| RNJ-1 8B | 8B | 5.4 GB | 15 | 53.5 |
| Gemma 4 E4B | 8B | 5.4 GB | 15 | 32.1 |
| EXAONE Deep 7.8B | 7.8B | 5.3 GB | 15 | 41.4 |
| InternLM2.5 7B | 7.74B | 5.2 GB | 16 | 44.8 |
| Qwen2.5-7B | 7.6B | 5.1 GB | 16 | 35.2 |
| Qwen2.5-Coder-7B | 7.6B | 5.1 GB | 16 | 31.0 |