Gemini 2.5 Flash-Lite

Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 2.5 Flash-Lite is best for high-volume classification, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.

gemini-2.5-flash-lite

Property Description
Model code gemini-2.5-flash-lite
Supported data types

Inputs

Text, image, video, audio, PDF

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

File search

Supported

Function calling

Supported

Grounding with Google Maps

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL context

Supported

Versions
Read the model version patterns for more details.
Latest update July 2025
Knowledge cutoff January 2025