Name: Qwen3 30B-A3B
Price: 0.12 USD
Author: Alibaba

Qwen3 30B-A3B — Specifications

Developer	Alibaba
Type	LLM (MoE)
Modality	Text → Text
Parameters	30B total / 3B active (MoE)
Context window	128K
Max output	—
License	Apache 2.0 (open)
Open weights	✅ Yes
Released	2025
Input price	$0.12 /1M
Output price	$0.5 /1M
API providers	Alibaba, OpenRouter, Ollama

🖥️ Run it locally

VRAM (4-bit)	~18 GB
Minimum GPU	RTX 4090 24GB (Q4) — fast, 3B active

Official page →

A mixture-of-experts Qwen3 with 30B total parameters but only ~3B active per token — near-32B quality at a fraction of the compute, very fast locally. Apache 2.0, 128K context.