Alibaba/Mixture of Experts
Qwen 3.5 35B A3B
chatcodingreasoningmultilingualvisionmath
35B
Parameters (3B active)
256K
Context length
14
Benchmarks
4
Quantizations
300K
HF downloads
Architecture
MoE
Released
2026-02-01
Layers
64
KV Heads
4
Head Dim
128
Family
qwen
Quantizations & VRAM
Q4_K_M4.5 bpw
20.2 GB
VRAM required
94%
Quality
Q6_K6.5 bpw
28.9 GB
VRAM required
97%
Quality
Q8_08 bpw
35.5 GB
VRAM required
100%
Quality
FP1616 bpw
70.5 GB
VRAM required
100%
Quality
Benchmarks (14)
Arena Elo1485
IFEval91.9
MMBench91.5
MMLU-PRO85.3
GPQA Diamond81.9
MMMU75.1
MATH59.7
BBH58.3
BigCodeBench32.3
AA Intelligence30.7
MUSR19.1
AA Coding16.8
GPQA15.2
HLE12.8
Run with Ollama
$
ollama run qwen3.5:35b-a3bGPUs that can run this model
At Q4_K_M quantization. Sorted by minimum VRAM.
NVIDIA RTX 4090
24 GB VRAM • 1008 GB/s
NVIDIA
$1599
NVIDIA RTX 3090 Ti
24 GB VRAM • 1008 GB/s
NVIDIA
$999
NVIDIA RTX 3090
24 GB VRAM • 936 GB/s
NVIDIA
$850
AMD RX 7900 XTX
24 GB VRAM • 960 GB/s
AMD
$999
Apple M4 Pro (24GB)
24 GB VRAM • 273 GB/s
APPLE
$1399
NVIDIA L4 24GB
24 GB VRAM • 300 GB/s
NVIDIA
$2500
NVIDIA A10 24GB
24 GB VRAM • 600 GB/s
NVIDIA
$3500
Apple M2 (24GB)
24 GB VRAM • 100 GB/s
APPLE
$999
Apple M3 (24GB)
24 GB VRAM • 100 GB/s
APPLE
$999
Apple M4 (24GB)
24 GB VRAM • 120 GB/s
APPLE
$699
NVIDIA Tesla M40 24 GB
24 GB VRAM • 288 GB/s
NVIDIA
NVIDIA Tesla P10
24 GB VRAM • 694 GB/s
NVIDIA
NVIDIA Tesla P40
24 GB VRAM • 347 GB/s
NVIDIA
NVIDIA Quadro RTX 6000
24 GB VRAM • 672 GB/s
NVIDIA
NVIDIA Quadro RTX 6000 Passive
24 GB VRAM • 624 GB/s
NVIDIA
NVIDIA GeForce RTX 3090
24 GB VRAM • 936 GB/s
NVIDIA
$1499
NVIDIA A10 PCIe
24 GB VRAM • 600 GB/s
NVIDIA
NVIDIA A10G
24 GB VRAM • 600 GB/s
NVIDIA
NVIDIA RTX A5000
24 GB VRAM • 768 GB/s
NVIDIA
NVIDIA GeForce RTX 3090 Ti
24 GB VRAM • 1010 GB/s
NVIDIA
$1999
NVIDIA GeForce RTX 4090
24 GB VRAM • 1010 GB/s
NVIDIA
$1599
NVIDIA L40 CNX
24 GB VRAM • 864 GB/s
NVIDIA
NVIDIA L40G
24 GB VRAM • 864 GB/s
NVIDIA
AMD Radeon RX 7900 XTX
24 GB VRAM • 960 GB/s
AMD
$999
NVIDIA GeForce RTX 4090 D
24 GB VRAM • 1010 GB/s
NVIDIA
$1599
NVIDIA GeForce RTX 5090 D V2
24 GB VRAM • 1340 GB/s
NVIDIA
$1999
NVIDIA TITAN RTX
24 GB VRAM • 672 GB/s
NVIDIA
NVIDIA A30 PCIe
24 GB VRAM • 933 GB/s
NVIDIA
NVIDIA A30X
24 GB VRAM • 1220 GB/s
NVIDIA
NVIDIA PG506-207
24 GB VRAM • 933 GB/s
NVIDIA
Find the best GPU for Qwen 3.5 35B A3B
Build Hardware for Qwen 3.5 35B A3B