AI Observatory / Model Radar Qwen / qwen/qwen-2.5-72b-instruct

Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

32,768 Context
16,384 Max output
$0.36 / 1M Prompt price
2024-09-19 Created
01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider Qwen Input modalities text
Output modalities text Prompt price $0.36 / 1M
Completion price $0.40 / 1M Request price N/A
Context length 32,768 Max completion tokens 16,384
Supported parameters frequency_penalty, logit_bias, max_tokens, min_p, presence_penalty, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p
Best for qwen/qwen-2.5-72b-instruct

Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Coding workflows High-volume usage tool-capable low-cost
Compare 1 page

Google: Gemini 2.5 Flash vs Qwen2.5 72B Instruct

Google: Gemini 2.5 Flash and Qwen2.5 72B Instruct are both tracked in Model Radar. This page focuses on fast public models for high-volume usage. Google: Gemini 2.5 Flash exposes the larger context window. Google: Gemini 2.5 Flash is cheaper on prompt input pricing.

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.