As we speak we’re excited to share updates throughout the board to our Gemini 2.5 mannequin household:
- Gemini 2.5 Professional is mostly out there and secure (no adjustments from the 06-05 preview)
- Gemini 2.5 Flash is mostly out there and secure (no adjustments from the 05-20 preview, see pricing updates under)
- Gemini 2.5 Flash-Lite is now out there in preview
Gemini 2.5 fashions are pondering fashions, able to reasoning by means of their ideas earlier than responding, leading to enhanced efficiency and improved accuracy. Every mannequin has management over the pondering price range, giving builders the power to decide on when and the way a lot the mannequin “thinks” earlier than producing a response.
Overview of our household of Gemini 2.5 pondering fashions
Introducing Gemini 2.5 Flash-Lite
As we speak, we’re introducing 2.5 Flash-Lite in preview with the bottom latency and value within the 2.5 mannequin household. It’s designed as a cheap improve from our earlier 1.5 and a couple of.0 Flash fashions. It additionally presents higher efficiency throughout most evals, and decrease time to first token whereas additionally reaching greater tokens per second decode. This mannequin is nice for top throughput duties like classification or summarization at scale.
Gemini 2.5 Flash-Lite is a reasoning mannequin, which permits for dynamic management of the pondering price range with an API parameter. As a result of Flash-Lite is optimized for price and pace, “pondering” is off by default, not like our different fashions. 2.5 Flash-Lite additionally helps all of our native instruments like Grounding with Google Search, Code Execution, and URL Context along with operate calling.
Benchmarks for Gemini 2.5 Flash-Lite
Updates to Gemini 2.5 Flash and pricing
During the last yr, our analysis groups have continued to push the pareto frontier with our Flash mannequin collection. When 2.5 Flash was initially introduced, we had not but finalized the capabilities for two.5 Flash-Lite. We additionally launched with a “pondering” and “non-thinking worth”, which led to developer confusion.
With the secure model of Gemini 2.5 Flash rolling out (which is identical 05-20 mannequin preview we made out there at Google I/O), and the unbelievable efficiency of two.5 Flash, we’re updating the pricing for two.5 Flash:
- $0.30 / 1M enter tokens (*up from $0.15 enter)
- $2.50 / 1M output tokens (*down from $3.50 output)
- We eliminated the pondering vs. non-thinking worth distinction
- We saved a single worth tier no matter enter token measurement
Whereas we try to take care of constant pricing between preview and secure releases to attenuate disruption, this can be a particular adjustment reflecting Flash’s distinctive worth, nonetheless providing the perfect cost-per-intelligence out there.
And with Gemini 2.5 Flash-Lite, we now have a good decrease price possibility (with or with out pondering) for price and latency delicate use instances that require much less mannequin intelligence.
Pricing updates for our Gemini Flash household
In case you are utilizing the Gemini 2.5 Flash Preview 04-17 , the prevailing preview pricing will stay in impact till its deliberate deprecation on July 15, 2025, at which level that mannequin endpoint shall be turned off. You may transition to the commonly out there mannequin “gemini-2.5-flash”, or change to 2.5 Flash-Lite Preview as a decrease price possibility.
Continued progress of Gemini 2.5 Professional
The expansion and demand for Gemini 2.5 Professional continues to be the steepest of any of our fashions we have now ever seen. To permit extra clients to construct on this mannequin in manufacturing, we’re making the 06-05 model of the mannequin secure, with the identical pareto frontier worth level as earlier than.
We count on that instances the place you want the very best intelligence and most capabilities are the place you will notice Professional shine, like coding and agentic duties. Gemini 2.5 Professional is on the coronary heart of lots of the most cherished developer instruments.
High developer instruments utilizing Gemini 2.5 Professional
In case you are utilizing 2.5 Professional Preview 05-06, the mannequin will stay out there till June 19, 2025 after which shall be turned off. In case you are utilizing 2.5 Professional Preview 06-05, you may merely replace your mannequin string to “gemini-2.5-pro”.
We will’t wait to see much more domains profit from the intelligence of two.5 Professional and look ahead to sharing extra about scaling past Professional within the close to future.