AI Observatory / Model Radar DeepSeek / deepseek/deepseek-r1-distill-llama-70b

DeepSeek: R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

Back To Model Radar Return To Index

131,072 Context

16,384 Max output

$0.70 / 1M Prompt price

2025-01-23 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	DeepSeek	Input modalities	text
Output modalities	text	Prompt price	$0.70 / 1M
Completion price	$0.80 / 1M	Request price	N/A
Context length	131,072	Max completion tokens	16,384
Supported parameters	frequency_penalty, include_reasoning, logit_bias, max_tokens, min_p, presence_penalty, reasoning, repetition_penalty, response_format, seed, stop, temperature, top_k, top_p

Best for deepseek/deepseek-r1-distill-llama-70b

DeepSeek: R1 Distill Llama 70B

Deep analysis High-volume usage low-cost

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

Aion Labs $0.70 / 1M

AionLabs: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

Deep analysis Coding workflows High-volume usage

Allenai $0.15 / 1M

AllenAI: Olmo 3 32B Think

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

Deep analysis High-volume usage

Amazon $0.30 / 1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

Deep analysis Cross-modal work Long context High-volume usage

Arcee Ai $0.90 / 1M

Arcee AI: Maestro Reasoning

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

Deep analysis High-volume usage

Arcee Ai $0.22 / 1M

Arcee AI: Trinity Large Thinking

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7

Deep analysis Long context High-volume usage Coding workflows

Arcee Ai $0.04 / 1M

Arcee AI: Trinity Mini

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

Deep analysis High-volume usage Coding workflows

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.