AI Observatory / Model Radar Qwen / qwen/qwen3.5-flash-02-23

Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

1,000,000 Context
65,536 Max output
$0.07 / 1M Prompt price
2026-02-25 Created
01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider Qwen Input modalities text, image, video
Output modalities text Prompt price $0.07 / 1M
Completion price $0.26 / 1M Request price N/A
Context length 1,000,000 Max completion tokens 65,536
Supported parameters include_reasoning, max_tokens, presence_penalty, reasoning, response_format, seed, structured_outputs, temperature, tool_choice, tools, top_p
Best for qwen/qwen3.5-flash-02-23

Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Cross-modal work Long context High-volume usage Coding workflows new vision tool-capable long-context low-cost
03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.