01.AI/Dense

Yi-1.5 6B

chat

6.06B

Parameters

Context length

Benchmarks

Quantizations

80K

HF downloads

Architecture

Dense

Released

2024-05-13

Layers

KV Heads

Head Dim

128

Family

Model Card

View on HuggingFace

<div align="center"> <picture> <img src="https://raw.githubusercontent.com/01-ai/Yi/main/assets/img/Yi_logo_icon_light.svg" width="150px"> </picture> </div> <p align="center"> <a href="https://github.com/01-ai">🐙 GitHub</a> • <a href="https://discord.gg/hYUwWddeAu">👾 Discord</a> • <a href="https://twitter.com/01ai_yi">🐤 Twitter</a> • <a href="https://github.com/01-ai/Yi-1.5/issues/2">💬 WeChat</a> <br/> <a href="https://arxiv.org/abs/2403.04652">📝 Paper</a> • <a href="https://01-ai.github.io/">💪 Tech Blog</a> • <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#faq">🙌 FAQ</a> • <a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#learning-hub">📗 Learning Hub</a> </p>

Intro

Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples.

Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.

Name	Download
Yi-1.5-34B-Chat	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-34B-Chat-16K	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-9B-Chat	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-9B-Chat-16K	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-6B-Chat	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel

Name	Download
Yi-1.5-34B	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-34B-32K	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-9B	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-9B-32K	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel
Yi-1.5-6B	• 🤗 Hugging Face • 🤖 ModelScope • 🟣 wisemodel

Yi-1.5 6B

Model Card

Intro

Models

Benchmarks

Quick Start

Quantizations & VRAM

Benchmarks (8)

GPUs that can run this model