Our most cost-effective AI mannequin but

Immediately, we’re introducing Gemini 3.1 Flash-Lite, our quickest and most cost-efficient Gemini 3 sequence mannequin. Constructed for high-volume developer workloads at scale, 3.1 Flash-Lite delivers prime quality for its worth and mannequin tier.

Beginning at present, 3.1 Flash-Lite is rolling out in preview to builders by way of the Gemini API in Google AI Studio and for enterprises by way of Vertex AI.

Value-efficiency with out compromise

Priced at simply $0.25/1M enter tokens and $1.50/1M output tokens, 3.1 Flash-Lite delivers enhanced efficiency at a fraction of the price of bigger fashions. It outperforms 2.5 Flash with a 2.5X sooner Time to First Reply Token and 45% improve in output velocity, in keeping with the Synthetic Evaluation benchmark whereas sustaining related or higher high quality. This low latency is required for high-frequency workflows, making it a super mannequin for builders to construct responsive, real-time experiences.