▸ SPEC SHEET
Qwen 3.5 4B — 4B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 4B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 256K tokens
- CAPABILITIES
- chat, coding, reasoning, multilingual, vision, math
- RELEASE DATE
- 2026-03-01
- PROVIDER
- Alibaba
- FAMILY
- qwen
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q4_K_M | 4.89 | 2.9 GB | 94% |
| Q5_K_S | 5.57 | 3.3 GB | 96% |
| Q5_K_M | 5.7 | 3.3 GB | 96% |
| Q6_K | 6.56 | 3.8 GB | 97% |
| Q8_0 | 8.5 | 4.7 GB | 100% |
| FP16 | 16 | 8.5 GB | 100% |
§ 01BENCHMARK SCORES
MMLU-PRO79.1
MATH49.6
IFEval89.8
BBH34.9
MMMU77.6
GPQA6.4
MUSR8.7
MMBench89.4
GPQA Diamond76.2
HLE7.8
AA Intelligence27.1
AA Coding17.5
LiveCodeBench55.8
aa_ifbench52.0
aa_terminal_bench18.2
aa_tau292.1
aa_scicode16.1
aa_lcr55.7
§ 02RUN COMMAND
Run Qwen 3.5 4B locally with Ollama — needs 2.9 GB VRAM at Q4_K_M:
$
ollama run qwen3.5:4b§ 03COMPATIBLE GPUs
30 @ Q4_K_MNVIDIA Tesla C2050
3 GB · 144 GB/s
NVIDIA Tesla M2050
3 GB · 148 GB/s
NVIDIA Tesla S2050
3 GB · 148 GB/s
NVIDIA GeForce GTX 670MX
3 GB · 67 GB/s
AMD Radeon HD 7950
3 GB · 240 GB/s
AMD Radeon HD 7950 Boost
3 GB · 240 GB/s
AMD Radeon HD 7950 Monica BIOS 1
3 GB · 240 GB/s
AMD Radeon HD 7950 Monica BIOS 2
3 GB · 240 GB/s
AMD Radeon HD 7970
3 GB · 264 GB/s
AMD Radeon HD 7970 GHz Edition
3 GB · 288 GB/s
AMD Radeon HD 7970 X2
3 GB · 264 GB/s
NVIDIA GeForce GTX 770M
3 GB · 96 GB/s
NVIDIA GeForce GTX 780
3 GB · 288 GB/s
NVIDIA GeForce GTX 780 Rev. 2
3 GB · 288 GB/s
NVIDIA GeForce GTX 780 Ti
3 GB · 337 GB/s