▸ SPEC SHEET
Qwen3.5-0.8B — 0.9B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 0.9B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 256K tokens
- CAPABILITIES
- chat
- RELEASE DATE
- 2025-06-01
- PROVIDER
- Alibaba
- FAMILY
- qwen
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q4_K_M | 4.89 | 1.0 GB | 94% |
| Q5_K_S | 5.57 | 1.1 GB | 96% |
| Q5_K_M | 5.7 | 1.1 GB | 96% |
| Q6_K | 6.56 | 1.2 GB | 97% |
| Q8_0 | 8.5 | 1.4 GB | 100% |
| FP16 | 16 | 2.3 GB | 100% |
§ 01BENCHMARK SCORES
MMLU-PRO2.4
MATH60.0
IFEval17.5
BBH3.4
MUSR90.0
BigCodeBench8.8
GPQA Diamond11.1
HLE1.2
AA Intelligence10.5
AA Coding0.0
GPQA6.7
aa_ifbench21.6
aa_tau265.2
aa_scicode2.9
aa_lcr6.7
§ 02RUN COMMAND
Run Qwen3.5-0.8B locally with Ollama — needs 1.0 GB VRAM at Q4_K_M:
$
ollama run qwen3:900m§ 03COMPATIBLE GPUs
30 @ Q4_K_MNVIDIA GeForce GTX 470
1 GB · 133.9 GB/s
NVIDIA GeForce GTX 570
1 GB · 152 GB/s
NVIDIA GeForce GTX 570 Rev. 2
1 GB · 152 GB/s
NVIDIA GeForce GTX 460 v2 ES
1 GB · 128.3 GB/s
NVIDIA GeForce GTX 560 OEM
1 GB · 128.3 GB/s
NVIDIA GeForce GTX 560 Ti 448
1 GB · 152 GB/s
NVIDIA Quadro FX 5600
2 GB · 76.8 GB/s
NVIDIA Quadro FX 5600 Mac Edition
2 GB · 76.8 GB/s
NVIDIA Tesla C870
2 GB · 76.8 GB/s
NVIDIA Tesla D870
2 GB · 76.8 GB/s
NVIDIA Tesla S870
2 GB · 76.8 GB/s
NVIDIA Quadro CX
2 GB · 76.8 GB/s
NVIDIA Quadro FX 4800
2 GB · 76.8 GB/s
NVIDIA GeForce GT 230 OEM
2 GB · 24 GB/s
NVIDIA GeForce GT 440 OEM
2 GB · 43.2 GB/s