Gemini 3.1 Flash-Lite: Built for intelligence at scale

5 hours ago

Gemini 3.1 Flash-Lite is introduced as the fastest and most cost-efficient model in the Gemini 3 series.
Priced at $0.25/1M input tokens and $1.50/1M output tokens, it offers enhanced performance at a lower cost.
It outperforms Gemini 2.5 Flash with 2.5X faster Time to First Answer Token and 45% increase in output speed.
Achieves an Elo score of 1432 on Arena.ai Leaderboard and excels in reasoning and multimodal benchmarks.
Features thinking levels in AI Studio and Vertex AI, allowing developers to control model 'thinking' for tasks.
Capable of handling high-volume tasks like translation and content moderation, as well as complex workloads like UI generation.
Early testers praise its efficiency, reasoning capabilities, and ability to follow instructions precisely.
Already being used by companies like Latitude, Cartwheel, and Whering for scalable problem-solving.

Hasty Briefsbeta