bge-large-en-v1.5 335M — 0.335B Parameter Dense LLM
Model Specifications
- Parameters
- 0.335B
- Architecture
- Dense Transformer
- Context Length
- 1K tokens
- Capabilities
- chat
- Release Date
- 2024-02-10
- Provider
- BAAI
- Family
- embedding
VRAM Requirements
| Quantization | BPW | VRAM | Quality |
|---|---|---|---|
| Q4_K_M | 4.89 | 0.7 GB | 94% |
| Q5_K_S | 5.57 | 0.7 GB | 96% |
| Q5_K_M | 5.7 | 0.7 GB | 96% |
| Q6_K | 6.56 | 0.8 GB | 97% |
| Q8_0 | 8.5 | 0.8 GB | 100% |
| FP16 | 16 | 1.2 GB | 100% |
Benchmark Scores
MMLU-PRO62.3
How to Run bge-large-en-v1.5 335M
Run bge-large-en-v1.5 335M locally with Ollama (needs 0.7 GB VRAM at Q4_K_M):
ollama run embedding:0bCompatible GPUs (30)
GPUs that can run bge-large-en-v1.5 335M at Q4_K_M quantization:
AMD FireGL V7350(1GB, 41.6 GB/s)NVIDIA Quadro FX 5500(1GB, 32.3 GB/s)NVIDIA Quadro FX 5500 SDI(1GB, 32.3 GB/s)AMD Stream Processor(1GB, 41.5 GB/s)AMD FireGL V8600(1GB, 111.1 GB/s)NVIDIA GeForce 9600 GT Mac Edition(1GB, 17 GB/s)NVIDIA GeForce 9600M GS(1GB, 25.6 GB/s)NVIDIA GeForce 9650M GT(1GB, 25.6 GB/s)NVIDIA GeForce 9800M GTS(1GB, 51.2 GB/s)NVIDIA GeForce 9800M GTX(1GB, 51.2 GB/s)NVIDIA GeForce GTX 280(1GB, 141.7 GB/s)NVIDIA GeForce GTX 285(1GB, 159 GB/s)NVIDIA Quadro FX 3700M(1GB, 51.2 GB/s)NVIDIA Quadro FX 3800M(1GB, 64 GB/s)NVIDIA Quadro FX 4700 X2(1GB, 51.2 GB/s)