AI Observatory / Model Radar Qwen / qwen/qwen-2.5-72b-instruct

Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Back To Model Radar Open Compare

32,768 Context

16,384 Max output

$0.36 / 1M Prompt price

2024-09-19 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	Qwen	Input modalities	text
Output modalities	text	Prompt price	$0.36 / 1M
Completion price	$0.40 / 1M	Request price	N/A
Context length	32,768	Max completion tokens	16,384
Supported parameters	frequency_penalty, logit_bias, max_tokens, min_p, presence_penalty, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p

Best for qwen/qwen-2.5-72b-instruct

Qwen2.5 72B Instruct

Coding workflows High-volume usage tool-capable low-cost

Compare 1 page

Google: Gemini 2.5 Flash vs Qwen2.5 72B Instruct

Google: Gemini 2.5 Flash and Qwen2.5 72B Instruct are both tracked in Model Radar. This page focuses on fast public models for high-volume usage. Google: Gemini 2.5 Flash exposes the larger context window. Google: Gemini 2.5 Flash is cheaper on prompt input pricing.

Open Compare

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

Aion Labs $0.70 / 1M

AionLabs: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

Deep analysis Coding workflows High-volume usage

Alfredpros $0.80 / 1M

AlfredPros: CodeLLaMa 7B Instruct Solidity

A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.

Coding workflows High-volume usage

Amazon $0.30 / 1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

Deep analysis Cross-modal work Long context High-volume usage

Amazon $0.06 / 1M

Amazon: Nova Lite 1.0

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

Cross-modal work Long context High-volume usage Coding workflows

Amazon $0.04 / 1M

Amazon: Nova Micro 1.0

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

High-volume usage Coding workflows

Amazon $0.80 / 1M

Amazon: Nova Pro 1.0

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

Cross-modal work Long context High-volume usage Coding workflows

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.