MiniMax/Mixture of Experts

MiniMax-M2.5 228.7B

chat

228.7B

Parameters (21B active)

192K

Context length

7

Benchmarks

17

Quantizations

493K

HF downloads

Architecture

MoE

Released

2026-03-10

Layers

62

KV Heads

8

Head Dim

128

Family

minimax

Quantization Options

Quant	Bits	VRAM	Quality
IQ2_XXS	2.38	68.5 GB	low
IQ2_M	2.93	84.3 GB	low
Q2_K	3.16	90.8 GB	low
IQ3_XXS	3.25	93.4 GB	low
IQ3_XS	3.5	100.5 GB	low
Q3_K_S	3.64	104.5 GB	low
IQ3_M	3.76	108.0 GB	low
Q3_K_M	4	114.8 GB	low
Q3_K_L	4.3	123.4 GB	moderate
IQ4_XS	4.46	128.0 GB	moderate
Q4_K_S	4.67	134.0 GB	moderate
Q4_K_M	4.89	140.3 GB	good
Q5_K_S	5.57	159.7 GB	good
Q5_K_M	5.7	163.4 GB	good
Q6_K	6.56	188.0 GB	excellent
Q8_0	8.5	243.5 GB	lossless
FP16	16	457.9 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (7)

Arena Elo1495

MATH86.3

GPQA85.2

GPQA Diamond84.8

AA Intelligence41.9

AA Coding37.4

HLE19.1

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run minimax:228b-q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA H200 NVL

141 GB VRAM • 4890 GB/s

NVIDIA H200 SXM 141 GB

141 GB VRAM • 4890 GB/s

144 GB VRAM • 4100 GB/s

AMD Instinct MI300X

192 GB VRAM • 5300 GB/s

Apple M2 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M3 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M4 Ultra (192GB)

192 GB VRAM • 1092 GB/s

AMD Radeon Instinct MI300A

192 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI300X

192 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI308X

192 GB VRAM • 10300 GB/s

Apple M5 Ultra (192GB)

192 GB VRAM • 1228 GB/s

AMD Radeon Instinct MI325X

288 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI350X

288 GB VRAM • 8190 GB/s

AMD Radeon Instinct MI355X

288 GB VRAM • 8190 GB/s

Apple M4 Ultra (384GB)

384 GB VRAM • 1092 GB/s

Apple M5 Ultra (384GB)

384 GB VRAM • 1228 GB/s

Find the best GPU for MiniMax-M2.5 228.7B

Build Hardware for MiniMax-M2.5 228.7B

MiniMax-M2.5 228.7B — 228.7B Parameter Mixture of Experts LLM

Model Specifications

Parameters: 228.7B (21B active)
Architecture: Mixture of Experts
Context Length: 192K tokens
Capabilities: chat
Release Date: 2026-03-10
Provider: MiniMax
Family: minimax

VRAM Requirements

Quantization	BPW	VRAM	Quality
IQ2_XXS	2.38	68.5 GB	65%
IQ2_M	2.93	84.3 GB	75%
Q2_K	3.16	90.8 GB	78%
IQ3_XXS	3.25	93.4 GB	82%
IQ3_XS	3.5	100.5 GB	84%
Q3_K_S	3.64	104.5 GB	85%
IQ3_M	3.76	108.0 GB	86%
Q3_K_M	4	114.8 GB	88%
Q3_K_L	4.3	123.4 GB	90%
IQ4_XS	4.46	128.0 GB	92%
Q4_K_S	4.67	134.0 GB	93%
Q4_K_M	4.89	140.3 GB	94%
Q5_K_S	5.57	159.7 GB	96%
Q5_K_M	5.7	163.4 GB	96%
Q6_K	6.56	188.0 GB	97%
Q8_0	8.5	243.5 GB	100%
FP16	16	457.9 GB	100%

Benchmark Scores

MATH86.3

GPQA85.2

Arena Elo1495.0

GPQA Diamond84.8

HLE19.1

AA Intelligence41.9

AA Coding37.4

How to Run MiniMax-M2.5 228.7B

Run MiniMax-M2.5 228.7B locally with Ollama (needs 140.3 GB VRAM at Q4_K_M):

ollama run minimax:228b

Compatible GPUs (16)

GPUs that can run MiniMax-M2.5 228.7B at Q4_K_M quantization:

NVIDIA H200 NVL(141GB, 4890 GB/s)NVIDIA H200 SXM 141 GB(141GB, 4890 GB/s)NVIDIA B300(144GB, 4100 GB/s)AMD Instinct MI300X(192GB, 5300 GB/s)Apple M2 Ultra (192GB)(192GB, 800 GB/s)Apple M3 Ultra (192GB)(192GB, 800 GB/s)Apple M4 Ultra (192GB)(192GB, 1092 GB/s)AMD Radeon Instinct MI300A(192GB, 10300 GB/s)AMD Radeon Instinct MI300X(192GB, 10300 GB/s)AMD Radeon Instinct MI308X(192GB, 10300 GB/s)Apple M5 Ultra (192GB)(192GB, 1228 GB/s)AMD Radeon Instinct MI325X(288GB, 10300 GB/s)AMD Radeon Instinct MI350X(288GB, 8190 GB/s)AMD Radeon Instinct MI355X(288GB, 8190 GB/s)Apple M4 Ultra (384GB)(384GB, 1092 GB/s)