DeepSeek R1 — Specifications
| Developer | DeepSeek |
|---|
| Type | LLM (MoE, reasoning) |
|---|
| Modality | Text → Text |
|---|
| Parameters | 671B total / 37B active (MoE) |
|---|
| Context window | 128K |
|---|
| Max output | — |
|---|
| License | MIT (open) |
|---|
| Open weights | ✅ Yes |
|---|
| Released | 2025 |
|---|
| Input price | $0.50 /1M |
|---|
| Output price | $2.15 /1M |
|---|
| API providers | DeepSeek, DeepInfra, OpenRouter |
|---|
🖥️ Run it locally
| VRAM (4-bit) | ~400 GB |
|---|
| Minimum GPU | Multi-GPU server |
|---|
Official page →
DeepSeek’s landmark open reasoning model — a 671B mixture-of-experts (37B active), MIT-licensed, delivering frontier-class chain-of-thought reasoning at a fraction of proprietary cost. Server-class to self-host, but openly downloadable.