AI Observatory / Model Radar Xiaomi / xiaomi/mimo-v2-omni

Xiaomi: MiMo-V2-Omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

Back To Model Radar Return To Index

262,144 Context

65,536 Max output

$0.40 / 1M Prompt price

2026-03-18 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	Xiaomi	Input modalities	text, audio, image, video
Output modalities	text	Prompt price	$0.40 / 1M
Completion price	$2.00 / 1M	Request price	N/A
Context length	262,144	Max completion tokens	65,536
Supported parameters	frequency_penalty, include_reasoning, max_tokens, presence_penalty, reasoning, response_format, stop, temperature, tool_choice, tools, top_p

Best for xiaomi/mimo-v2-omni

Xiaomi: MiMo-V2-Omni

Cross-modal work Voice and audio Long context High-volume usage new vision tool-capable long-context low-cost

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

~Google $0.50 / 1M

Google Gemini Flash Latest

This model always redirects to the latest model in the Google Gemini Flash family.

Cross-modal work Voice and audio Long context High-volume usage

Google $0.10 / 1M

Google: Gemini 2.0 Flash

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

Cross-modal work Voice and audio Long context High-volume usage

Google $0.07 / 1M

Google: Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

Cross-modal work Voice and audio Long context High-volume usage

Google $0.30 / 1M

Google: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Deep analysis Coding workflows Cross-modal work Voice and audio

Google $0.10 / 1M

Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Deep analysis Cross-modal work Voice and audio Long context

Google $0.10 / 1M

Google: Gemini 2.5 Flash Lite Preview 09-2025

Deep analysis Cross-modal work Voice and audio Long context

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.