exaone/Dense

EEXAONE Deep 2.4B

reasoningmathcoding
2.4B
Parameters
32K
Context length
6
Benchmarks
6
Quantizations
Architecture
Dense
Released
2025-03-16
Layers
30
KV Heads
8
Head Dim
128
Family
exaone

Quantization Options

QuantBitsVRAMQuality
Q4_K_M4.892.0 GBgood
Q5_K_S5.572.2 GBgood
Q5_K_M5.72.2 GBgood
Q6_K6.562.5 GBexcellent
Q8_08.53.0 GBlossless
FP16165.3 GBlossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (6)

IFEval79.5
MATH36.8
MMLU-PRO25.3
BBH15.9
MUSR3.2
GPQA2.1

Run this model

Easiest way to get starteddocs →
curl -fsSL https://ollama.com/install.sh | sh
$ollama run exaone-deep:2.4b:q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

Setup guide

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

Find the best GPU for EXAONE Deep 2.4B

Build Hardware for EXAONE Deep 2.4B

EXAONE Deep 2.4B2.4B Parameter Dense LLM

Model Specifications

Parameters
2.4B
Architecture
Dense Transformer
Context Length
32K tokens
Capabilities
reasoning, math, coding
Release Date
2025-03-16
Family
exaone

VRAM Requirements

QuantizationBPWVRAMQuality
Q4_K_M4.892.0 GB94%
Q5_K_S5.572.2 GB96%
Q5_K_M5.72.2 GB96%
Q6_K6.562.5 GB97%
Q8_08.53.0 GB100%
FP16165.3 GB100%

Benchmark Scores

MMLU-PRO25.3
MATH36.8
IFEval79.5
BBH15.9
GPQA2.1
MUSR3.2

How to Run EXAONE Deep 2.4B

Run EXAONE Deep 2.4B locally with Ollama (needs 2.0 GB VRAM at Q4_K_M):

ollama run exaone-deep:2.4b

Compatible GPUs (30)

GPUs that can run EXAONE Deep 2.4B at Q4_K_M quantization: