Gemini 3 Flash

Frontier reasoning at Flash speed; 3x faster than 2.5 Pro with superior reasoning; 1M token context; 30% fewer tokens; $0.50/1M input token pricing.

Google Gemini

Gemini 3 Flash released December 16, 2025 as the new default model in the Gemini app, representing a major capability leap combining frontier Gemini 3 reasoning with Flash-tier speed and efficiency. The model delivers PhD-level reasoning comparable to much larger models while maintaining 3x faster inference than Gemini 2.5 Proachieving 0.21-0.37 second first-token latency and 163 tokens/second output speed. Remarkably, 3 Flash outperforms 2.5 Pro on coding benchmarks (78% vs 91% on SWE-bench Verified) while costing 75% less ($0.50 vs $2+ per million input tokens). The model uses 30% fewer tokens on average than 2.5 Pro on typical traffic while improving performance. Features 1-million-token context, native audio I/O, dynamic reasoning depth adaptation, and multimodal input support. Immediately available globally to Gemini app users at no cost. Knowledge cutoff: January 2025. Now the recommended default for production deployments.

Reviews

No Reviews Yet

Be the first to share your experience with this AI tool

Gemini 3 Flash

Reviews

No Reviews Yet

More models from Google Gemini

Gemini 2.5 Flash-Lite

Gemini 3 Pro

Gemini 1.0 Pro

Gemini 1.0 Ultra

Gemini 1.5 Pro

Gemini 1.5 Flash

Gemini 2.0 Flash-Lite

Gemini 2.0 Pro

Gemini 2.0 Flash

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 1.0 Nano