Llama 3.1 8B — Specifications
| المطوِّر | Meta |
|---|
| النوع | LLM (dense) |
|---|
| النمط | Text → Text |
|---|
| المعلمات | 8B |
|---|
| نافذة السياق | 128K |
|---|
| أقصى إخراج | — |
|---|
| الترخيص | Llama 3.1 Community (open) |
|---|
| الأوزان المفتوحة | ✅ Yes |
|---|
| تاريخ الإصدار | 2024 |
|---|
| Input price | $0.02 /1M |
|---|
| Output price | $0.03 /1M |
|---|
| API providers | Together, DeepInfra, OpenRouter, Ollama |
|---|
🖥️ Run it locally
| VRAM (4-bit) | ~5 GB |
|---|
| Minimum GPU | Any 8GB GPU |
|---|
Official page →
The workhorse small Llama — 8B, 128K context, runs on almost any modern GPU. One of the most widely deployed open models in production, and an excellent cheap baseline.