Rednote/Dense

Rdots.llm1.inst 142.8B

chat

142.8B

Parameters

32K

Context length

0

Benchmarks

17

Quantizations

8K

HF downloads

Architecture

Dense

Released

2025-05-14

Layers

62

KV Heads

32

Head Dim

128

Family

other

Quantization Options

Quant	Bits	VRAM	Quality
IQ2_XXS	2.38	43.0 GB	low
IQ2_M	2.93	52.8 GB	low
Q2_K	3.16	56.9 GB	low
IQ3_XXS	3.25	58.5 GB	low
IQ3_XS	3.5	63.0 GB	low
Q3_K_S	3.64	65.5 GB	low
IQ3_M	3.76	67.6 GB	low
Q3_K_M	4	71.9 GB	low
Q3_K_L	4.3	77.2 GB	moderate
IQ4_XS	4.46	80.1 GB	moderate
Q4_K_S	4.67	83.8 GB	moderate
Q4_K_M	4.89	87.8 GB	good
Q5_K_S	5.57	99.9 GB	good
Q5_K_M	5.7	102.2 GB	good
Q6_K	6.56	117.6 GB	excellent
Q8_0	8.5	152.2 GB	lossless
FP16	16	286.1 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

Run this model

Easiest way to get starteddocs →

curl -fsSL https://ollama.com/install.sh | sh

$ollama run other:142b-q4_k_m

Downloads and runs automatically. Add --verbose for speed stats.

HuggingFace Ollama Library GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

90 GB VRAM • 4100 GB/s

NVIDIA H100 NVL 94 GB

94 GB VRAM • 3940 GB/s

NVIDIA H100 SXM5 94 GB

94 GB VRAM • 3360 GB/s

96 GB VRAM • 1792 GB/s

NVIDIA H100 PCIe 96 GB

96 GB VRAM • 3360 GB/s

NVIDIA H100 SXM5 96 GB

96 GB VRAM • 3360 GB/s

Intel Data Center GPU Max 1350

96 GB VRAM • 2460 GB/s

NVIDIA RTX PRO 6000 Blackwell Server

96 GB VRAM • 1790 GB/s

NVIDIA RTX PRO 6000 Blackwell

96 GB VRAM • 1790 GB/s

AMD Instinct MI300A

120 GB VRAM • 5300 GB/s

Apple M4 Max (128GB)

128 GB VRAM • 546 GB/s

AMD Instinct MI250X

128 GB VRAM • 3277 GB/s

Apple M1 Ultra (128GB)

128 GB VRAM • 800 GB/s

Apple M2 Ultra (128GB)

128 GB VRAM • 800 GB/s

AMD Radeon Instinct MI250

128 GB VRAM • 3280 GB/s

AMD Radeon Instinct MI250X

128 GB VRAM • 3280 GB/s

AMD Radeon Instinct MI300

128 GB VRAM • 6550 GB/s

Intel Data Center GPU Max 1550

128 GB VRAM • 3280 GB/s

Intel Data Center GPU Max Subsystem

128 GB VRAM • 3210 GB/s

128 GB VRAM • 273 GB/s

NVIDIA Jetson T5000

128 GB VRAM • 273 GB/s

Apple M5 Max (128GB)

128 GB VRAM • 614 GB/s

NVIDIA H200 SXM 141GB

140 GB VRAM • 4800 GB/s

NVIDIA H200 NVL

141 GB VRAM • 4890 GB/s

NVIDIA H200 SXM 141 GB

141 GB VRAM • 4890 GB/s

144 GB VRAM • 4100 GB/s

AMD Instinct MI300X

192 GB VRAM • 5300 GB/s

Apple M2 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M3 Ultra (192GB)

192 GB VRAM • 800 GB/s

Apple M4 Ultra (192GB)

192 GB VRAM • 1092 GB/s

Find the best GPU for dots.llm1.inst 142.8B

Build Hardware for dots.llm1.inst 142.8B

dots.llm1.inst 142.8B — 142.8B Parameter Dense LLM

Model Specifications

Parameters: 142.8B
Architecture: Dense Transformer
Context Length: 32K tokens
Capabilities: chat
Release Date: 2025-05-14
Provider: Rednote
Family: other

VRAM Requirements

Quantization	BPW	VRAM	Quality
IQ2_XXS	2.38	43.0 GB	65%
IQ2_M	2.93	52.8 GB	75%
Q2_K	3.16	56.9 GB	78%
IQ3_XXS	3.25	58.5 GB	82%
IQ3_XS	3.5	63.0 GB	84%
Q3_K_S	3.64	65.5 GB	85%
IQ3_M	3.76	67.6 GB	86%
Q3_K_M	4	71.9 GB	88%
Q3_K_L	4.3	77.2 GB	90%
IQ4_XS	4.46	80.1 GB	92%
Q4_K_S	4.67	83.8 GB	93%
Q4_K_M	4.89	87.8 GB	94%
Q5_K_S	5.57	99.9 GB	96%
Q5_K_M	5.7	102.2 GB	96%
Q6_K	6.56	117.6 GB	97%
Q8_0	8.5	152.2 GB	100%
FP16	16	286.1 GB	100%

How to Run dots.llm1.inst 142.8B

Run dots.llm1.inst 142.8B locally with Ollama (needs 87.8 GB VRAM at Q4_K_M):

ollama run other:142b

Compatible GPUs (30)

GPUs that can run dots.llm1.inst 142.8B at Q4_K_M quantization:

NVIDIA B200(90GB, 4100 GB/s)NVIDIA H100 NVL 94 GB(94GB, 3940 GB/s)NVIDIA H100 SXM5 94 GB(94GB, 3360 GB/s)RTX Pro 6000(96GB, 1792 GB/s)NVIDIA H100 PCIe 96 GB(96GB, 3360 GB/s)NVIDIA H100 SXM5 96 GB(96GB, 3360 GB/s)Intel Data Center GPU Max 1350(96GB, 2460 GB/s)NVIDIA RTX PRO 6000 Blackwell Server(96GB, 1790 GB/s)NVIDIA RTX PRO 6000 Blackwell(96GB, 1790 GB/s)AMD Instinct MI300A(120GB, 5300 GB/s)Apple M4 Max (128GB)(128GB, 546 GB/s)AMD Instinct MI250X(128GB, 3277 GB/s)Apple M1 Ultra (128GB)(128GB, 800 GB/s)Apple M2 Ultra (128GB)(128GB, 800 GB/s)AMD Radeon Instinct MI250(128GB, 3280 GB/s)