Monday, 22 June 2026 | Updating Daily AI insight, written for builders

Llama 3.3 70B

Llama 3.3 70B — Specifications

DesarrolladorMeta
TipoLLM (dense)
ModalidadText → Text
Parámetros70B
Ventana de contexto128K
Salida máxima
LicenciaLlama 3.3 Community (open)
Pesos abiertos✅ Yes
Fecha de lanzamiento2024
Input price$0.10 /1M
Output price$0.32 /1M
API providersTogether, DeepInfra, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)~40 GB
Minimum GPU2× RTX 4090 / 1× 48GB

Official page →

Meta’s efficient 70B dense model — near-405B quality at a fraction of the size, with a 128K context. One of the most-deployed open models for self-hosting; runs 4-bit on dual-24GB or a single 48GB card.

Scroll to Top