Monday, 22 June 2026 | Updating Daily AI insight, written for builders

Qwen3 30B-A3B

Qwen3 30B-A3B — Specifications

SviluppatoreAlibaba
TipoLLM (MoE)
ModalitàText → Text
Parametri30B total / 3B active (MoE)
Finestra contestuale128K
Output massimo
LicenzaApache 2.0 (open)
Pesi aperti✅ Yes
Rilasciato2025
Input price$0.12 /1M
Output price$0.5 /1M
API providersAlibaba, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)~18 GB
Minimum GPURTX 4090 24GB (Q4) — fast, 3B active

Official page →

A mixture-of-experts Qwen3 with 30B total parameters but only ~3B active per token — near-32B quality at a fraction of the compute, very fast locally. Apache 2.0, 128K context.

Scroll to Top