Baidu/Dense

BERNIE-4.5-300B-A47B-Paddle

chat

300.5B

Parameters

128K

Context length

1

Benchmarks

17

Quantizations

1K

HF downloads

Architecture

Dense

Released

2025-06-28

Layers

54

KV Heads

8

Head Dim

128

Family

ernie

Quantization Options

Quant	Bits	VRAM	Quality
IQ2_XXS	2.38	89.9 GB	low
IQ2_M	2.93	110.5 GB	low
Q2_K	3.16	119.2 GB	low
IQ3_XXS	3.25	122.6 GB	low
IQ3_XS	3.5	132.0 GB	low
Q3_K_S	3.64	137.2 GB	low
IQ3_M	3.76	141.7 GB	low
Q3_K_M	4	150.7 GB	low
Q3_K_L	4.3	162.0 GB	moderate
IQ4_XS	4.46	168.0 GB	moderate
Q4_K_S	4.67	175.9 GB	moderate
Q4_K_M	4.89	184.2 GB	good
Q5_K_S	5.57	209.7 GB	good
Q5_K_M	5.7	214.6 GB	good
Q6_K	6.56	246.9 GB	excellent
Q8_0	8.5	319.8 GB	lossless
FP16	16	601.5 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Benchmarks (1)

GPQA Diamond74.0

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run ernie:300b-q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

AMD Instinct MI300X

192 GB VRAM • 5300 GB/s

Apple M2 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M3 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M4 Ultra (192GB)

192 GB VRAM • 1092 GB/s

AMD Radeon Instinct MI300A

192 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI300X

192 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI308X

192 GB VRAM • 10300 GB/s

Apple M5 Ultra (192GB)

192 GB VRAM • 1228 GB/s

AMD Radeon Instinct MI325X

288 GB VRAM • 10300 GB/s

AMD Radeon Instinct MI350X

288 GB VRAM • 8190 GB/s

AMD Radeon Instinct MI355X

288 GB VRAM • 8190 GB/s

Apple M4 Ultra (384GB)

384 GB VRAM • 1092 GB/s

Apple M5 Ultra (384GB)

384 GB VRAM • 1228 GB/s

Find the best GPU for ERNIE-4.5-300B-A47B-Paddle

Build Hardware for ERNIE-4.5-300B-A47B-Paddle

ERNIE-4.5-300B-A47B-Paddle — 300.5B Parameter Dense LLM

Model Specifications

Parameters: 300.5B
Architecture: Dense Transformer
Context Length: 128K tokens
Capabilities: chat
Release Date: 2025-06-28
Provider: Baidu
Family: ernie

VRAM Requirements

Quantization	BPW	VRAM	Quality
IQ2_XXS	2.38	89.9 GB	65%
IQ2_M	2.93	110.5 GB	75%
Q2_K	3.16	119.2 GB	78%
IQ3_XXS	3.25	122.6 GB	82%
IQ3_XS	3.5	132.0 GB	84%
Q3_K_S	3.64	137.2 GB	85%
IQ3_M	3.76	141.7 GB	86%
Q3_K_M	4	150.7 GB	88%
Q3_K_L	4.3	162.0 GB	90%
IQ4_XS	4.46	168.0 GB	92%
Q4_K_S	4.67	175.9 GB	93%
Q4_K_M	4.89	184.2 GB	94%
Q5_K_S	5.57	209.7 GB	96%
Q5_K_M	5.7	214.6 GB	96%
Q6_K	6.56	246.9 GB	97%
Q8_0	8.5	319.8 GB	100%
FP16	16	601.5 GB	100%

Benchmark Scores

GPQA Diamond74.0

How to Run ERNIE-4.5-300B-A47B-Paddle

Run ERNIE-4.5-300B-A47B-Paddle locally with Ollama (needs 184.2 GB VRAM at Q4_K_M):

ollama run ernie:300b

Compatible GPUs (13)

GPUs that can run ERNIE-4.5-300B-A47B-Paddle at Q4_K_M quantization:

AMD Instinct MI300X(192GB, 5300 GB/s)Apple M2 Ultra (192GB)(192GB, 800 GB/s)Apple M3 Ultra (192GB)(192GB, 800 GB/s)Apple M4 Ultra (192GB)(192GB, 1092 GB/s)AMD Radeon Instinct MI300A(192GB, 10300 GB/s)AMD Radeon Instinct MI300X(192GB, 10300 GB/s)AMD Radeon Instinct MI308X(192GB, 10300 GB/s)Apple M5 Ultra (192GB)(192GB, 1228 GB/s)AMD Radeon Instinct MI325X(288GB, 10300 GB/s)AMD Radeon Instinct MI350X(288GB, 8190 GB/s)AMD Radeon Instinct MI355X(288GB, 8190 GB/s)Apple M4 Ultra (384GB)(384GB, 1092 GB/s)Apple M5 Ultra (384GB)(384GB, 1228 GB/s)