Gemma 4 E2B — 5.1B Parameter Dense LLM
Model Specifications
- Parameters
- 5.1B
- Architecture
- Dense Transformer
- Context Length
- 128K tokens
- Capabilities
- chat, multilingual, vision, audio
- Release Date
- 2026-04-02
- Provider
- Family
- gemma
VRAM Requirements
| Quantization | BPW | VRAM | Quality |
|---|---|---|---|
| Q4_K_M | 4.89 | 3.6 GB | 94% |
| Q5_K_S | 5.57 | 4.0 GB | 96% |
| Q5_K_M | 5.7 | 4.1 GB | 96% |
| Q6_K | 6.56 | 4.7 GB | 97% |
| Q8_0 | 8.5 | 5.9 GB | 100% |
| FP16 | 16 | 10.7 GB | 100% |
Benchmark Scores
MMLU-PRO60.0
BBH21.9
GPQA Diamond43.4
LiveCodeBench44.0
AIME37.5
How to Run Gemma 4 E2B
Run Gemma 4 E2B locally with Ollama (needs 3.6 GB VRAM at Q4_K_M):
ollama run gemma4:e2bCompatible GPUs (30)
GPUs that can run Gemma 4 E2B at Q4_K_M quantization:
NVIDIA Tesla C1080(4GB, 102 GB/s)NVIDIA Tesla K10(4GB, 160 GB/s)NVIDIA Tesla M4(4GB, 88 GB/s)NVIDIA GeForce GTX 1050 Ti(4GB, 112 GB/s)AMD Radeon Instinct MI8(4GB, 512 GB/s)NVIDIA GeForce GTX 1650(4GB, 128 GB/s)NVIDIA GeForce GTX 1650 SUPER(4GB, 192 GB/s)AMD Radeon RX 5300 XT OEM(4GB, 112 GB/s)AMD Radeon RX 5500 OEM(4GB, 224 GB/s)AMD Radeon RX 5500 XT(4GB, 224 GB/s)AMD Radeon RX 5500M(4GB, 224 GB/s)NVIDIA GeForce GTX 1650 GDDR6(4GB, 192 GB/s)NVIDIA GeForce GTX 1650 TU106(4GB, 192 GB/s)NVIDIA GeForce GTX 1650 TU116(4GB, 192 GB/s)NVIDIA GeForce GTX 1630(4GB, 96 GB/s)