Qwen3 8B — Specifications
| Desenvolvedor | Alibaba |
|---|---|
| Tipo | LLM (dense) |
| Modalidade | Text → Text |
| Parâmetros | 8B |
| Janela de contexto | 128K |
| Saída máxima | — |
| Licença | Apache 2.0 (open) |
| Pesos abertos | ✅ Yes |
| Lançamento | 2025 |
| Input price | $0.04 /1M |
| Output price | $0.14 /1M |
| API providers | Alibaba, OpenRouter, Ollama |
🖥️ Run it locally
| VRAM (4-bit) | ~5 GB |
|---|---|
| Minimum GPU | RTX 3060 8GB / any 8GB GPU |
A small, fast dense Qwen3 (Apache 2.0, 128K) — one of the best models that fits comfortably on an 8GB GPU at 4-bit.
