Qwen3 8B — Specifications
| Entwickler | Alibaba |
|---|---|
| Typ | LLM (dense) |
| Modalität | Text → Text |
| Parameter | 8B |
| Kontextfenster | 128K |
| Maximale Ausgabe | — |
| Lizenz | Apache 2.0 (open) |
| Offene Gewichte | ✅ Yes |
| Veröffentlichung | 2025 |
| Input price | $0.04 /1M |
| Output price | $0.14 /1M |
| API providers | Alibaba, OpenRouter, Ollama |
🖥️ Run it locally
| VRAM (4-bit) | ~5 GB |
|---|---|
| Minimum GPU | RTX 3060 8GB / any 8GB GPU |
A small, fast dense Qwen3 (Apache 2.0, 128K) — one of the best models that fits comfortably on an 8GB GPU at 4-bit.
