Monday, 22 June 2026 | Updating Daily AI insight, written for builders

Llama 3.1 8B

Llama 3.1 8B — Specifications

DesarrolladorMeta
TipoLLM (dense)
ModalidadText → Text
Parámetros8B
Ventana de contexto128K
Salida máxima
LicenciaLlama 3.1 Community (open)
Pesos abiertos✅ Yes
Fecha de lanzamiento2024
Input price$0.02 /1M
Output price$0.03 /1M
API providersTogether, DeepInfra, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)~5 GB
Minimum GPUAny 8GB GPU

Official page →

The workhorse small Llama — 8B, 128K context, runs on almost any modern GPU. One of the most widely deployed open models in production, and an excellent cheap baseline.

Scroll to Top