Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 2.5 Flash-Lite is best for high-volume classification, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.
gemini-2.5-flash-lite
| Property | Description |
|---|---|
| Model code | gemini-2.5-flash-lite |
| Supported data types |
Inputs Text, image, video, audio, PDF Output Text |
| Token limits[*] |
Input token limit 1,048,576 Output token limit 65,536 |
| Capabilities |
Audio generation Not supported Batch API Supported Caching Supported Code execution Supported File search Supported Function calling Supported Grounding with Google Maps Supported Image generation Not supported Live API Not supported Search grounding Supported Structured outputs Supported Thinking Supported URL context Supported |
| Versions |
|
| Latest update | July 2025 |
| Knowledge cutoff | January 2025 |