Gemini 2.5 Flash-Lite

Fastest and most cost-efficient model in 2.5 family; optimized for classification and summarization at scale; lowest latency with strong performance.

Google Gemini

Gemini 2.5 Flash-Lite released June 17, 2025 as the cost-optimized tier of the 2.5 family, designed for high-throughput applications prioritizing speed and economy. The model achieves the lowest latency in the Gemini 2.5 lineup and highest tokens-per-second decode rate, making it ideal for processing classification tasks, content summarization, and other operations at scale across millions of daily requests. Flash-Lite delivers performance improvements over previous generation 2.0 Flash-Lite across reasoning, coding, multimodality, and long-context understanding. The model supports 1-million-token context and thinking mode capabilities. Pricing and efficiency make it the recommended choice for cost-sensitive deployments while still offering respectable quality. Knowledge cutoff: January 2025. Generally available in production July 2025.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

Gemini 2.5 Flash-Lite

Reviews

No Reviews Yet

More models from Google Gemini

Gemini 3 Pro

Gemini 1.0 Pro

Gemini 1.0 Ultra

Gemini 1.5 Pro

Gemini 1.5 Flash

Gemini 2.0 Flash-Lite

Gemini 2.0 Pro

Gemini 2.0 Flash

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 3 Flash

Gemini 1.0 Nano