DeepSeek R1 Distill Llama 70B — Specifications
| Développeur | DeepSeek |
|---|
| Type | LLM (dense, reasoning) |
|---|
| Modalité | Text → Text |
|---|
| Paramètres | 70 milliards |
|---|
| Fenêtre de contexte | 128K |
|---|
| Sortie maximale | — |
|---|
| Licence | MIT (open) |
|---|
| Poids ouverts | ✅ Yes |
|---|
| Date de sortie | 2025 |
|---|
| Input price | $0.80 /1M |
|---|
| Output price | $0.80 /1M |
|---|
| API providers | DeepInfra, OpenRouter, Ollama |
|---|
🖥️ Run it locally
| VRAM (4-bit) | ~40 GB |
|---|
| Minimum GPU | 2× RTX 4090 / 1× 48GB |
|---|
Official page →
R1’s reasoning distilled into a 70B Llama base — brings much of DeepSeek R1’s step-by-step reasoning to hardware you can actually self-host. MIT-licensed, 128K context.