▸ SPEC SHEET
Qwen3-8B — 8.2B Dense.
▸ SPECIFICATIONS
- PARAMETERS
- 8.2B
- ARCHITECTURE
- Dense Transformer
- CONTEXT LENGTH
- 40K tokens
- CAPABILITIES
- chat
- RELEASE DATE
- 2025-04-28
- PROVIDER
- Alibaba
- FAMILY
- qwen
▸ VRAM REQUIREMENTS
| QUANT | BPW | VRAM | QUALITY |
|---|---|---|---|
| Q4_K_M | 4.89 | 5.5 GB | 94% |
| Q5_K_S | 5.57 | 6.2 GB | 96% |
| Q5_K_M | 5.7 | 6.3 GB | 96% |
| Q6_K | 6.56 | 7.2 GB | 97% |
| Q8_0 | 8.5 | 9.2 GB | 100% |
| FP16 | 16 | 16.9 GB | 100% |
§ 01BENCHMARK SCORES
HumanEval85.0
MMLU-PRO55.0
MATH89.0
IFEval78.0
BBH36.6
GPQA45.0
MUSR15.5
Arena Elo1462.0
GPQA Diamond61.2
LiveCodeBench51.3
AIME63.7
MATH-50093.2
HLE5.6
AA Intelligence16.4
AA Coding7.8
AA Math63.7
aa_ifbench19.9
aa_terminal_bench1.5
aa_scicode20.4
aa_lcr13.0
§ 02RUN COMMAND
Run Qwen3-8B locally with Ollama — needs 5.5 GB VRAM at Q4_K_M:
$
ollama run qwen3:8.2b§ 03COMPATIBLE GPUs
30 @ Q4_K_MNVIDIA RTX 3050 6GB
6 GB · 168 GB/s
Intel Arc A380
6 GB · 186 GB/s
NVIDIA RTX 2060 6GB
6 GB · 336 GB/s
NVIDIA GTX 1660 SUPER
6 GB · 336 GB/s
NVIDIA GTX 1660 Ti
6 GB · 288 GB/s
NVIDIA GTX 1060 6GB
6 GB · 192 GB/s
NVIDIA Tesla C2070
6 GB · 143 GB/s
NVIDIA Tesla C2075
6 GB · 150 GB/s
NVIDIA Tesla C2090
6 GB · 177 GB/s
NVIDIA Tesla M2070
6 GB · 150 GB/s
NVIDIA Tesla M2070-Q
6 GB · 150 GB/s
NVIDIA Tesla M2075
6 GB · 150 GB/s
NVIDIA Tesla M2090
6 GB · 177 GB/s
NVIDIA Tesla X2070
6 GB · 177 GB/s
NVIDIA Tesla X2090
6 GB · 177 GB/s