Monday, 22 June 2026 | Updating Daily AI insight, written for builders

Llama 3.1 8B

Llama 3.1 8B — Specifications

DesenvolvedorMeta
TipoLLM (dense)
ModalidadeText → Text
Parâmetros8B
Janela de contexto128K
Saída máxima
LicençaLlama 3.1 Community (open)
Pesos abertos✅ Yes
Lançamento2024
Input price$0.02 /1M
Output price$0.03 /1M
API providersTogether, DeepInfra, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)~5 GB
Minimum GPUAny 8GB GPU

Official page →

The workhorse small Llama — 8B, 128K context, runs on almost any modern GPU. One of the most widely deployed open models in production, and an excellent cheap baseline.

Scroll to Top