▸ SPEC SHEET
Qwen3.5-4B — 4.7B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 4.7B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 256K tokens
- CAPABILITIES
- chat
- RELEASE DATE
- 2025-06-01
- PROVIDER
- Alibaba
- FAMILY
- qwen
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q4_K_M | 4.89 | 3.4 GB | 94% |
| Q5_K_S | 5.57 | 3.8 GB | 96% |
| Q5_K_M | 5.7 | 3.8 GB | 96% |
| Q6_K | 6.56 | 4.3 GB | 97% |
| Q8_0 | 8.5 | 5.5 GB | 100% |
| FP16 | 16 | 9.9 GB | 100% |
§ 01BENCHMARK SCORES
MMLU-PRO15.5
MATH2.8
IFEval31.6
BBH16.3
GPQA2.2
MUSR7.4
GPQA Diamond77.1
HLE7.8
AA Intelligence27.1
AA Coding17.5
aa_ifbench52.0
aa_terminal_bench18.2
aa_tau292.1
aa_scicode16.1
aa_lcr55.7
§ 02RUN COMMAND
Run Qwen3.5-4B locally with Ollama — needs 3.4 GB VRAM at Q4_K_M:
$
ollama run qwen3:4.7b§ 03COMPATIBLE GPUs
30 @ Q4_K_MNVIDIA Tesla C1080
4 GB · 102 GB/s
NVIDIA Tesla K10
4 GB · 160 GB/s
NVIDIA Tesla M4
4 GB · 88 GB/s
NVIDIA GeForce GTX 1050 Ti
4 GB · 112 GB/s
AMD Radeon Instinct MI8
4 GB · 512 GB/s
NVIDIA GeForce GTX 1650
4 GB · 128 GB/s
NVIDIA GeForce GTX 1650 SUPER
4 GB · 192 GB/s
AMD Radeon RX 5300 XT OEM
4 GB · 112 GB/s
AMD Radeon RX 5500 OEM
4 GB · 224 GB/s
AMD Radeon RX 5500 XT
4 GB · 224 GB/s
AMD Radeon RX 5500M
4 GB · 224 GB/s
NVIDIA GeForce GTX 1650 GDDR6
4 GB · 192 GB/s
NVIDIA GeForce GTX 1650 TU106
4 GB · 192 GB/s
NVIDIA GeForce GTX 1650 TU116
4 GB · 192 GB/s
NVIDIA GeForce GTX 1630
4 GB · 96 GB/s