AI Observatory / Model Radar
Inclusionai / inclusionai/ling-2.6-flash
inclusionAI: Ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
262,144
Context
32,768
Max output
$0.08 / 1M
Prompt price
2026-04-21
Created
01 / Snapshot
Pricing, context, modalities, and parameters.
Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.
| Provider | Inclusionai | Input modalities | text |
|---|---|---|---|
| Output modalities | text | Prompt price | $0.08 / 1M |
| Completion price | $0.24 / 1M | Request price | N/A |
| Context length | 262,144 | Max completion tokens | 32,768 |
| Supported parameters | frequency_penalty, max_tokens, presence_penalty, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p | ||
Best for
inclusionai/ling-2.6-flash
inclusionAI: Ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Long context
High-volume usage
Coding workflows
new
tool-capable
long-context
low-cost
03 / Colophon
Routes and exits.
Each model page stays simple: overview, compare, related models, then back to the public hub.