Qwen3 32B — Specifications
| Desenvolvedor | Alibaba |
|---|
| Tipo | LLM (dense) |
|---|
| Modalidade | Text → Text |
|---|
| Parâmetros | 32B |
|---|
| Janela de contexto | 128K |
|---|
| Saída máxima | — |
|---|
| Licença | Apache 2.0 (open) |
|---|
| Pesos abertos | ✅ Yes |
|---|
| Lançamento | 2025 |
|---|
| Input price | $0.08 /1M |
|---|
| Output price | $0.28 /1M |
|---|
| API providers | Alibaba, OpenRouter, Ollama |
|---|
🖥️ Run it locally
| VRAM (4-bit) | ~20 GB |
|---|
| Minimum GPU | RTX 4090 24GB (Q4) |
|---|
Official page →
Alibaba’s largest dense Qwen3 — strong general reasoning and coding under a fully permissive Apache 2.0 license, with a 128K context. Comfortably runs 4-bit on a 24GB consumer GPU.