Qwen3 30B-A3B — Specifications
| Développeur | Alibaba |
|---|
| Type | LLM (MoE) |
|---|
| Modalité | Text → Text |
|---|
| Paramètres | 30B total / 3B active (MoE) |
|---|
| Fenêtre de contexte | 128K |
|---|
| Sortie maximale | — |
|---|
| Licence | Apache 2.0 (open) |
|---|
| Poids ouverts | ✅ Yes |
|---|
| Date de sortie | 2025 |
|---|
| Input price | $0.12 /1M |
|---|
| Output price | $0.5 /1M |
|---|
| API providers | Alibaba, OpenRouter, Ollama |
|---|
🖥️ Run it locally
| VRAM (4-bit) | ~18 GB |
|---|
| Minimum GPU | RTX 4090 24GB (Q4) — fast, 3B active |
|---|
Official page →
A mixture-of-experts Qwen3 with 30B total parameters but only ~3B active per token — near-32B quality at a fraction of the compute, very fast locally. Apache 2.0, 128K context.