AI Observatory / Model Radar Google / google/gemini-3.1-flash-lite

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Back To Model Radar Return To Index

1,048,576 Context

65,536 Max output

$0.25 / 1M Prompt price

2026-05-07 Created

01 / Snapshot

Pricing, context, modalities, and parameters.

Model Radar detail pages stay neutral and operator-readable: core metadata first, then workflow fit.

Provider	Google	Input modalities	text, image, video, file, audio
Output modalities	text	Prompt price	$0.25 / 1M
Completion price	$1.50 / 1M	Request price	N/A
Context length	1,048,576	Max completion tokens	65,536
Supported parameters	include_reasoning, max_tokens, reasoning, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p

Best for google/gemini-3.1-flash-lite

Google: Gemini 3.1 Flash Lite

Cross-modal work Voice and audio Long context High-volume usage new vision tool-capable long-context low-cost

02 / Related

Related models in nearby categories.

Related models are derived from overlapping use-case categories so the detail page stays navigable.

~Google $0.50 / 1M

Google Gemini Flash Latest

This model always redirects to the latest model in the Google Gemini Flash family.

Cross-modal work Voice and audio Long context High-volume usage

Google $0.10 / 1M

Google: Gemini 2.0 Flash

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

Cross-modal work Voice and audio Long context High-volume usage

Google $0.07 / 1M

Google: Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

Cross-modal work Voice and audio Long context High-volume usage

Google $0.30 / 1M

Google: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Deep analysis Coding workflows Cross-modal work Voice and audio

Google $0.10 / 1M

Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Deep analysis Cross-modal work Voice and audio Long context

Google $0.10 / 1M

Google: Gemini 2.5 Flash Lite Preview 09-2025

Deep analysis Cross-modal work Voice and audio Long context

03 / Colophon

Routes and exits.

Each model page stays simple: overview, compare, related models, then back to the public hub.