TII/Dense

Falcon-H1R 7B

Name: Falcon-H1R 7B
Author: TII

Reasoning variant with extended thinking. Beats 50B+ models on math/code with just 7B params. 256K context.

chatcodingreasoning

Parameters

256K

Context length

Benchmarks

Quantizations

Architecture

Dense

Released

2026-01-07

Layers

KV Heads

Head Dim

128

Family

falcon

Quantization Options

Quant	Bits	VRAM	Quality
Q4_K_M	4.89	4.8 GB	good
Q5_K_S	5.57	5.4 GB	good
Q5_K_M	5.7	5.5 GB	good
Q6_K	6.56	6.2 GB	excellent
Q8_0	8.5	7.9 GB	lossless
FP16	16	14.5 GB	lossless

Select your GPU above to see speed estimates and compatibility for each quantization.

▸ READY TO RUN THIS?RENT BY THE HOUR

RENT A GPU AND RUN FALCON-H1R 7B NOW

Rent on RunPod →Or Vast.ai →

Spin up an A100 / H100 / 4090 in ~60s. Pay by the second. Cancel anytime.

Community Ratings

Loading ratings...

Benchmarks (18)

MATH97.4

AIME83.1

AA Math80.0

IFEval76.1

LiveCodeBench72.4

MMLU-PRO72.1

GPQA Diamond61.3

IFBench54.4

BBH37.9

τ²-Bench27.8

SciCode24.9

MUSR21.2

AA Intelligence15.8

HLE10.8

AA Coding9.8

AA Long Context8.7

GPQA8.1

Terminal-Bench2.3

Run this model

▸Easiest way to get started·Beginners

DOCS ↗

curl -fsSL https://ollama.com/install.sh | sh

$ollama run falcon:7b-q4_K_M

Tag may need adjustment — check ollama.com/library/falcon for available tags.

▸ SETUP GUIDE

Auto-setup with fitmyllm CLI

Detects your GPU, recommends the best model, downloads it, and starts chatting — zero config. Benchmarks your speed and contributes anonymous data to improve predictions.

pip install fitmyllmthen run fitmyllmLearn more

Auto-detect GPULive tok/s in chatSpeed benchmarks9 inference engines

HuggingFace GGUF Downloads Build Hardware

GPUs that can run this model

At Q4_K_M quantization. Sorted by minimum VRAM.

NVIDIA Tesla K20c

5 GB VRAM • 208 GB/s

NVIDIA

NVIDIA Tesla K20m

5 GB VRAM • 208 GB/s

NVIDIA

NVIDIA Tesla K20s

5 GB VRAM • 208 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 5 GB

5 GB VRAM • 160 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA P102-100

5 GB VRAM • 440 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA Quadro P2000

5 GB VRAM • 140.2 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA Quadro P2200

5 GB VRAM • 200.2 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA RTX 3050 6GB

6 GB VRAM • 168 GB/s

NVIDIA

$169

Amazon Newegg eBay PCPartPicker

Intel Arc A380

6 GB VRAM • 186 GB/s

INTEL

$129

Amazon Newegg eBay PCPartPicker

NVIDIA RTX 2060 6GB

6 GB VRAM • 336 GB/s

NVIDIA

$150

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1660 SUPER

6 GB VRAM • 336 GB/s

NVIDIA

$150

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1660 Ti

6 GB VRAM • 288 GB/s

NVIDIA

$140

Amazon Newegg eBay PCPartPicker

NVIDIA GTX 1060 6GB

6 GB VRAM • 192 GB/s

NVIDIA

$80

Amazon Newegg eBay PCPartPicker

NVIDIA Tesla C2070

6 GB VRAM • 143 GB/s

NVIDIA

NVIDIA Tesla C2075

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla C2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla M2070

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2070-Q

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2075

6 GB VRAM • 150 GB/s

NVIDIA

NVIDIA Tesla M2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla X2070

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla X2090

6 GB VRAM • 177 GB/s

NVIDIA

NVIDIA Tesla K20X

6 GB VRAM • 250 GB/s

NVIDIA

NVIDIA Tesla K20Xm

6 GB VRAM • 250 GB/s

NVIDIA

NVIDIA GeForce GTX 1060 6 GB

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB 9Gbps

6 GB VRAM • 217 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB GDDR5X

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB GP104

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1060 6 GB Rev. 2

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

NVIDIA GeForce GTX 1660

6 GB VRAM • 192 GB/s

NVIDIA

Amazon Newegg eBay PCPartPicker

Find the best GPU for Falcon-H1R 7B

Build Hardware for Falcon-H1R 7B

Falcon-H1R 7B

Quantization Options

Community Ratings

Benchmarks (18)

Run this model

Auto-setup with fitmyllm CLI

GPUs that can run this model

Falcon-H1R 7B — 7B Dense.