Monday, 22 June 2026 | Updating Daily AI insight, written for builders

Llama 3.1 8B

Llama 3.1 8B — Specifications

DeveloperMeta
TypeLLM (dense)
ModalityText → Text
Parameters8B
Context window128K
Max output
LicenseLlama 3.1 Community (open)
Open weights✅ Yes
Released2024
Input price$0.02 /1M
Output price$0.03 /1M
API providersTogether, DeepInfra, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)~5 GB
Minimum GPUAny 8GB GPU

Official page →

The workhorse small Llama — 8B, 128K context, runs on almost any modern GPU. One of the most widely deployed open models in production, and an excellent cheap baseline.

Scroll to Top