Name: Qwen3 30B-A3B
Price: 0.12 USD
Author: Alibaba

Qwen3 30B-A3B — Specifications

Sviluppatore	Alibaba
Tipo	LLM (MoE)
Modalità	Text → Text
Parametri	30B total / 3B active (MoE)
Finestra contestuale	128K
Output massimo	—
Licenza	Apache 2.0 (open)
Pesi aperti	✅ Yes
Rilasciato	2025
Input price	$0.12 /1M
Output price	$0.5 /1M
API providers	Alibaba, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)	~18 GB
Minimum GPU	RTX 4090 24GB (Q4) — fast, 3B active

Official page →

A mixture-of-experts Qwen3 with 30B total parameters but only ~3B active per token — near-32B quality at a fraction of the compute, very fast locally. Apache 2.0, 128K context.