Qwen 3.5 0.8B — 0.8B Parameter Dense LLM
Model Specifications
- Parameters
- 0.8B
- Architecture
- Dense Transformer
- Context Length
- 256K tokens
- Capabilities
- chat, coding, multilingual, vision
- Release Date
- 2026-03-01
- Provider
- Alibaba
- Family
- qwen
VRAM Requirements
| Quantization | BPW | VRAM | Quality |
|---|---|---|---|
| Q4_K_M | 4.89 | 1.0 GB | 94% |
| Q5_K_S | 5.57 | 1.0 GB | 96% |
| Q5_K_M | 5.7 | 1.1 GB | 96% |
| Q6_K | 6.56 | 1.1 GB | 97% |
| Q8_0 | 8.5 | 1.3 GB | 100% |
| FP16 | 16 | 2.1 GB | 100% |
Benchmark Scores
MMLU-PRO29.7
IFEval52.1
MMMU49.0
MMBench69.9
How to Run Qwen 3.5 0.8B
Run Qwen 3.5 0.8B locally with Ollama (needs 1.0 GB VRAM at Q4_K_M):
ollama run qwen3.5:0.8bCompatible GPUs (30)
GPUs that can run Qwen 3.5 0.8B at Q4_K_M quantization:
AMD FireGL V7350(1GB, 41.6 GB/s)NVIDIA Quadro FX 5500(1GB, 32.3 GB/s)NVIDIA Quadro FX 5500 SDI(1GB, 32.3 GB/s)AMD Stream Processor(1GB, 41.5 GB/s)AMD FireGL V8600(1GB, 111.1 GB/s)NVIDIA GeForce 9600 GT Mac Edition(1GB, 17 GB/s)NVIDIA GeForce 9600M GS(1GB, 25.6 GB/s)NVIDIA GeForce 9650M GT(1GB, 25.6 GB/s)NVIDIA GeForce 9800M GTS(1GB, 51.2 GB/s)NVIDIA GeForce 9800M GTX(1GB, 51.2 GB/s)NVIDIA GeForce GTX 280(1GB, 141.7 GB/s)NVIDIA GeForce GTX 285(1GB, 159 GB/s)NVIDIA Quadro FX 3700M(1GB, 51.2 GB/s)NVIDIA Quadro FX 3800M(1GB, 64 GB/s)NVIDIA Quadro FX 4700 X2(1GB, 51.2 GB/s)