Cogito 3B — 3B Parameter Dense LLM
Model Specifications
- Parameters
- 3B
- Architecture
- Dense Transformer
- Context Length
- 125K tokens
- Capabilities
- chat, reasoning, coding, tool_use
- Release Date
- 2025-04-01
- Family
- cogito
VRAM Requirements
| Quantization | BPW | VRAM | Quality |
|---|---|---|---|
| Q4_K_M | 4.89 | 2.3 GB | 94% |
| Q5_K_S | 5.57 | 2.6 GB | 96% |
| Q5_K_M | 5.7 | 2.6 GB | 96% |
| Q6_K | 6.56 | 2.9 GB | 97% |
| Q8_0 | 8.5 | 3.7 GB | 100% |
| FP16 | 16 | 6.5 GB | 100% |
Benchmark Scores
MMLU-PRO27.2
MATH21.8
IFEval39.3
BBH26.3
GPQA6.7
MUSR11.5
How to Run Cogito 3B
Run Cogito 3B locally with Ollama (needs 2.3 GB VRAM at Q4_K_M):
ollama run cogito:3bCompatible GPUs (30)
GPUs that can run Cogito 3B at Q4_K_M quantization:
NVIDIA Quadro 5000(3GB, 120 GB/s)NVIDIA Quadro 5000 SDI(3GB, 120 GB/s)NVIDIA Tesla C2050(3GB, 144 GB/s)NVIDIA Tesla M2050(3GB, 148 GB/s)NVIDIA Tesla S2050(3GB, 148 GB/s)NVIDIA GeForce GTX 670MX(3GB, 67 GB/s)AMD Radeon HD 7950(3GB, 240 GB/s)AMD Radeon HD 7950 Boost(3GB, 240 GB/s)AMD Radeon HD 7950 Monica BIOS 1(3GB, 240 GB/s)AMD Radeon HD 7950 Monica BIOS 2(3GB, 240 GB/s)AMD Radeon HD 7970(3GB, 264 GB/s)AMD Radeon HD 7970 GHz Edition(3GB, 288 GB/s)AMD Radeon HD 7970 X2(3GB, 264 GB/s)NVIDIA GeForce GTX 770M(3GB, 96 GB/s)NVIDIA GeForce GTX 780(3GB, 288 GB/s)