Mistral AI/Dense

Mistral-Large 123B

Name: Mistral-Large 123B
Author: Mistral AI

Mistral-Large-Instruct-2407 is an advanced dense Large Language Model (LLM) of 123B parameters with state-of-the-art reasoning, knowledge and coding capabilities.

chattool_useThinkingTool Use

123B

Parameters

128K

Context length

Benchmarks

Quantizations

Architecture

Dense

Released

2024-07-24

Layers

KV Heads

Head Dim

128

Family

mistral

Quantization Options

Quant	Bits	VRAM	Quality
IQ2_XXS	2.38	37.1 GB	low
IQ2_M	2.93	45.5 GB	low
Q2_K	3.16	49.1 GB	low
IQ3_XXS	3.25	50.5 GB	low
IQ3_XS	3.5	54.3 GB	low
Q3_K_S	3.64	56.5 GB	low
IQ3_M	3.76	58.3 GB	low
Q3_K_M	4	62.0 GB	low
Q3_K_L	4.3	66.6 GB	moderate
IQ4_XS	4.46	69.1 GB	moderate
Q4_K_S	4.67	72.3 GB	moderate
Q4_K_M	4.89	75.7 GB	good
Q5_K_S	5.57	86.1 GB	good
Q5_K_M	5.7	88.1 GB	good
Q6_K	6.56	101.3 GB	excellent
Q8_0	8.5	131.2 GB	lossless
FP16	16	246.5 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

▸ READY TO RUN THIS?RENT BY THE HOUR

RENT A GPU AND RUN MISTRAL-LARGE 123B NOW

Rent on RunPod →Or Vast.ai →

Spin up an A100 / H100 / 4090 in ~60s. Pay by the second. Cancel anytime.

Community Ratings

Loading ratings...

Benchmarks (11)

Arena Elo1267

IFEval84.0

BBH52.7

MMLU-PRO50.7

MATH49.5

GPQA Diamond35.1

GPQA24.9

LiveCodeBench17.8

MUSR17.2

HLE3.4

AIME0.0

Run this model

▸Easiest way to get started·Beginners

DOCS ↗

curl -fsSL https://ollama.com/install.sh | sh

$ollama run mistral:123b-q4_K_M

Downloads and runs automatically. Add --verbose for speed stats.

▸ SETUP GUIDE

Auto-setup with fitmyllm CLI

Detects your GPU, recommends the best model, downloads it, and starts chatting — zero config. Benchmarks your speed and contributes anonymous data to improve predictions.

pip install fitmyllmthen run fitmyllmLearn more

Auto-detect GPULive tok/s in chatSpeed benchmarks9 inference engines

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA H100 SXM5 80GB

80 GB VRAM • 3350 GB/s

NVIDIA

$25000

NVIDIA H100 PCIe 80GB

80 GB VRAM • 2000 GB/s

NVIDIA

$25000

NVIDIA A100 SXM 80GB

80 GB VRAM • 2039 GB/s

NVIDIA

$10000

NVIDIA A100 PCIe 80GB

80 GB VRAM • 1935 GB/s

NVIDIA

$10000

NVIDIA A100 SXM4 80 GB

80 GB VRAM • 2040 GB/s

NVIDIA

$15000

NVIDIA A100 PCIe 80 GB

80 GB VRAM • 1940 GB/s

NVIDIA

$10000

NVIDIA A100X

80 GB VRAM • 2040 GB/s

NVIDIA

NVIDIA H100 PCIe 80 GB

80 GB VRAM • 2040 GB/s

NVIDIA

$25000

NVIDIA H100 SXM5 80 GB

80 GB VRAM • 3360 GB/s

NVIDIA

$25000

NVIDIA H100 CNX

80 GB VRAM • 2040 GB/s

NVIDIA

$25000

NVIDIA A800 PCIe 80 GB

80 GB VRAM • 1940 GB/s

NVIDIA

NVIDIA A800 SXM4 80 GB

80 GB VRAM • 2040 GB/s

NVIDIA

NVIDIA H800 PCIe 80 GB

80 GB VRAM • 2040 GB/s

NVIDIA

NVIDIA H800 SXM5

80 GB VRAM • 3360 GB/s

NVIDIA

NVIDIA RTX 6000D

84 GB VRAM • 1570 GB/s

NVIDIA

$7500

NVIDIA B200

90 GB VRAM • 4100 GB/s

NVIDIA

$30000

NVIDIA H100 NVL 94 GB

94 GB VRAM • 3940 GB/s

NVIDIA

$30000

NVIDIA H100 SXM5 94 GB

94 GB VRAM • 3360 GB/s

NVIDIA

$25000

RTX Pro 6000

96 GB VRAM • 1792 GB/s

NVIDIA

$8565

NVIDIA H100 PCIe 96 GB

96 GB VRAM • 3360 GB/s

NVIDIA

$25000

NVIDIA H100 SXM5 96 GB

96 GB VRAM • 3360 GB/s

NVIDIA

$25000

Intel Data Center GPU Max 1350

96 GB VRAM • 2460 GB/s

INTEL

NVIDIA RTX PRO 6000 Blackwell Server

96 GB VRAM • 1790 GB/s

NVIDIA

$9999

NVIDIA RTX PRO 6000 Blackwell

96 GB VRAM • 1790 GB/s

NVIDIA

$9999

AMD Instinct MI300A

120 GB VRAM • 5300 GB/s

AMD

$12000

Apple M4 Max (128GB)

128 GB VRAM • 546 GB/s

APPLE

$3999

AMD Instinct MI250X

128 GB VRAM • 3277 GB/s

AMD

$10000

Apple M1 Ultra (128GB)

128 GB VRAM • 800 GB/s

APPLE

$4999

Apple M2 Ultra (128GB)

128 GB VRAM • 800 GB/s

APPLE

$3999

AMD Radeon Instinct MI250

128 GB VRAM • 3280 GB/s

AMD

$12000

Find the best GPU for Mistral-Large 123B

Build Hardware for Mistral-Large 123B

Model Card

View on HuggingFace

Mistral-Large-Instruct-2407 is an advanced dense Large Language Model (LLM) of 123B parameters with state-of-the-art reasoning, knowledge and coding capabilities.

Mistral-Large 123B

Quantization Options

Community Ratings

Benchmarks (11)

Run this model

Auto-setup with fitmyllm CLI

GPUs that can run this model

Model Card

Mistral-Large 123B — 123B Dense.