Mistral AI/Dense

Ministral-8B

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

chattool_useTool Use

8B

Parameters

32K

Context length

7

Benchmarks

6

Quantizations

0

Architecture

Dense

Released

2024-09-18

Layers

36

KV Heads

8

Head Dim

128

Family

mistral

Quantization Options

Quant	Bits	VRAM	Quality
Q4_K_M	4.89	5.4 GB	good
Q5_K_S	5.57	6.1 GB	good
Q5_K_M	5.7	6.2 GB	good
Q6_K	6.56	7.0 GB	excellent
Q8_0	8.5	9.0 GB	lossless
FP16	16	16.5 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

▸ READY TO RUN THIS?RENT BY THE HOUR

RENT A GPU AND RUN MINISTRAL-8B NOW

Rent on RunPod →Or Vast.ai →

Spin up an A100 / H100 / 4090 in ~60s. Pay by the second. Cancel anytime.

Community Ratings

Loading ratings...

Benchmarks (7)

IFEval54.7

BBH25.6

MMLU-PRO23.1

BigCodeBench19.5

MUSR4.3

MATH3.9

GPQA3.9

Run this model

▸Easiest way to get started·Beginners

curl -fsSL https://ollama.com/install.sh | sh

$ollama run mistral:8b-q4_K_M

Downloads and runs automatically. Add --verbose for speed stats.

▸ SETUP GUIDE

>_

Auto-setup with fitmyllm CLI

Detects your GPU, recommends the best model, downloads it, and starts chatting — zero config. Benchmarks your speed and contributes anonymous data to improve predictions.

pip install fitmyllmthen run fitmyllmLearn more

Auto-detect GPULive tok/s in chatSpeed benchmarks9 inference engines

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA RTX 3050 6GB

6 GB VRAM • 168 GB/s

NVIDIA

$169

Amazon Newegg eBay PCPartPicker

6 GB VRAM • 186 GB/s

INTEL

$129

Amazon Newegg eBay PCPartPicker

NVIDIA RTX 2060 6GB

6 GB VRAM • 336 GB/s

NVIDIA

$150

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1660 SUPER

6 GB VRAM • 336 GB/s

NVIDIA

$150

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1660 Ti

6 GB VRAM • 288 GB/s

NVIDIA

$140

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1060 6GB

6 GB VRAM • 192 GB/s

NVIDIA

$80

Amazon Newegg eBay PCPartPicker

NVIDIA Tesla C2070

6 GB VRAM • 143 GB/s

NVIDIA

NVIDIA Tesla C2075

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla C2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla M2070

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2070-Q

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2075

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla X2070

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla X2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla K20X

6 GB VRAM • 250 GB/s

NVIDIA

NVIDIA Tesla K20Xm

6 GB VRAM • 250 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB 9Gbps

6 GB VRAM • 217 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB GDDR5X

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB GP104

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB Rev. 2

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1660

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1660 SUPER

6 GB VRAM • 336 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1660 Ti

6 GB VRAM • 288 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce RTX 2060

6 GB VRAM • 336 GB/s

NVIDIA

$140

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce RTX 2060 TU104

6 GB VRAM • 336 GB/s

NVIDIA

$140

Amazon Newegg eBay PCPartPicker

AMD Radeon RX 5600 OEM

6 GB VRAM • 288 GB/s

AMD

Amazon Newegg eBay PCPartPicker

AMD Radeon RX 5600 XT

6 GB VRAM • 288 GB/s

AMD

$90

Amazon Newegg eBay PCPartPicker

AMD Radeon RX 5600M

6 GB VRAM • 288 GB/s

AMD

Amazon Newegg eBay PCPartPicker

Find the best GPU for Ministral-8B

Build Hardware for Ministral-8B

Model Card

View on HuggingFace

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

▸ SPEC SHEET

Ministral-8B — 8B Dense.

▸ SPECIFICATIONS

PARAMETERS: 8B
ARCHITECTURE: Dense Transformer
CONTEXT LENGTH: 32K tokens
CAPABILITIES: chat, tool_use
RELEASE DATE: 2024-09-18
PROVIDER: Mistral AI
FAMILY: mistral

▸ VRAM REQUIREMENTS

QUANT	BPW	VRAM	QUALITY
Q4_K_M	4.89	5.4 GB	94%
Q5_K_S	5.57	6.1 GB	96%
Q5_K_M	5.7	6.2 GB	96%
Q6_K	6.56	7.0 GB	97%
Q8_0	8.5	9.0 GB	100%
FP16	16	16.5 GB	100%

§ 01BENCHMARK SCORES

MMLU-PRO23.1

MATH3.9

IFEval54.7

BBH25.6

GPQA3.9

MUSR4.3

BigCodeBench19.5

§ 02RUN COMMAND

Run Ministral-8B locally with Ollama — needs 5.4 GB VRAM at Q4_K_M:

$ollama run mistral:8b

§ 03COMPATIBLE GPUs

30 @ Q4_K_M

NVIDIA RTX 3050 6GB

6 GB · 168 GB/s

6 GB · 186 GB/s

NVIDIA RTX 2060 6GB

6 GB · 336 GB/s

NVIDIA GTX 1660 SUPER

6 GB · 336 GB/s

NVIDIA GTX 1660 Ti

6 GB · 288 GB/s

NVIDIA GTX 1060 6GB

6 GB · 192 GB/s

NVIDIA Tesla C2070

6 GB · 143 GB/s

NVIDIA Tesla C2075

6 GB · 150 GB/s

NVIDIA Tesla C2090

6 GB · 177 GB/s

NVIDIA Tesla M2070

6 GB · 150 GB/s

NVIDIA Tesla M2070-Q

6 GB · 150 GB/s

NVIDIA Tesla M2075

6 GB · 150 GB/s

NVIDIA Tesla M2090

6 GB · 177 GB/s

NVIDIA Tesla X2070

6 GB · 177 GB/s

NVIDIA Tesla X2090

6 GB · 177 GB/s