Latest news with #Gemini2.5Flash-Lite

Google launches Gemini 2.5 models with pricing & speed updates

Techday NZ

25-06-2025

Business
Techday NZ

Google launches Gemini 2.5 models with pricing & speed updates

Google has released updates to its Gemini 2.5 suite of artificial intelligence models, detailing stable releases, new offerings, and pricing changes. Model releases The company announced that Gemini 2.5 Pro and Gemini 2.5 Flash are now generally available and deemed stable, maintaining the same versions that had previously been available for preview. In addition, Google introduced Gemini 2.5 Flash-Lite in preview, providing an option focused on cost-effectiveness and latency within the Gemini 2.5 product line. Gemini 2.5 models are described as "thinking models" capable of reasoning through their processes before generating responses, a feature that is expected to enhance the performance and accuracy of the tools. The models allow developers to manage a so-called "thinking budget", granting greater control over the depth and speed of reasoning based on the needs of individual applications. Gemini 2.5 Flash-Lite Gemini 2.5 Flash-Lite is intended as an upgrade for customers currently using previous iterations such as Gemini 1.5 and 2.0 Flash models. According to the company, the new model improves performance across several evaluation measures, reduces the time to first token, and increases decoding speed in terms of tokens per second. Flash-Lite is targeted at high-volume use cases like classification and summarisation at scale, where throughput and cost are key considerations. This model provides API-level control for dynamic management of the "thinking budget." It is set apart from other Gemini 2.5 models in that its "thinking" function is deactivated by default, reflecting its focus on cost and speed. Gemini 2.5 Flash-Lite includes existing features such as grounding with Google Search, code execution, URL context, and support for function calling. Updates and pricing Google also clarified changes to the Gemini 2.5 Flash model and its associated pricing structure. The pricing for 2.5 Flash has been updated to USD $0.30 per 1 million input tokens (increased from USD $0.15) and USD $2.50 per 1 million output tokens (reduced from USD $3.50). The company removed the distinction between "thinking" and "non-thinking" pricing and established a single price tier, irrespective of input token size. In a joint statement, Shrestha Basu Mallick, Group Product Manager, and Logan Kilpatrick, Group Product Manager, said: "While we strive to maintain consistent pricing between preview and stable releases to minimize disruption, this is a specific adjustment reflecting Flash's exceptional value, still offering the best cost-per-intelligence available. And with Gemini 2.5 Flash-Lite, we now have an even lower cost option (with or without thinking) for cost and latency sensitive use cases that require less model intelligence." Customers using Gemini 2.5 Flash Preview from April will retain their existing pricing until the model's planned deprecation on July 15, 2025, after which they will be required to transition to the updated stable version or move to Flash-Lite Preview. Continued growth for Gemini 2.5 Pro Google reported that demand for Gemini 2.5 Pro is "the steepest of any of our models we have ever seen." The stable release of the 06-05 version is intended to increase capacity for customers using Gemini 2.5 Pro in production environments, maintaining the existing price point. The company indicated that the model is particularly well-suited for tasks requiring significant intelligence and advanced capabilities, such as coding and agentic tasks, and noted its adoption in a range of developer tools. "We expect that cases where you need the highest intelligence and most capabilities are where you will see Pro shine, like coding and agentic tasks. Gemini 2.5 Pro is at the heart of many of the most loved developer tools." Google highlighted a range of tools built on Gemini 2.5 Pro, including offerings from Cursor, Bolt, Cline, Cognition, Windsurf, GitHub, Lovable, Replit, and Zed Industries. The company advised that users of the 2.5 Pro Preview 05-06 model will be able to access it until June 19, 2025, when it will be discontinued. Those using the 06-05 preview version are directed to update to the now-stable "gemini-2.5-pro" model. The statement concluded: "We can't wait to see even more domains benefit from the intelligence of 2.5 Pro and look forward to sharing more about scaling beyond Pro in the near future."

Google rolls out budget-friendly Gemini 2.5 Flash Lite, opens 2.5 Flash and Pro to all

India Today

18-06-2025

Business
India Today

Google rolls out budget-friendly Gemini 2.5 Flash Lite, opens 2.5 Flash and Pro to all

Google has introduced a new addition to its Gemini AI model line-up — the Gemini 2.5 Flash-Lite. According to Google, this new AI model can deliver high performance at the lowest cost and fastest speeds yet. Alongside the new model, the company has announced the general availability of the Gemini 2.5 Flash and Pro models to all says that Gemini 2.5 Flash-Lite is its most affordable and fastest model in the 2.5 family. It has been built to handle large volumes of latency-sensitive tasks such as translation, classification, and reasoning at a lower computational cost. Compared to its predecessor, 2.0 Flash-Lite, the new model is said to deliver improved accuracy and quality across coding, maths, science, reasoning, and multimodal benchmarks. 'It excels at high-volume, latency-sensitive tasks like translation and classification, with lower latency than 2.0 Flash-Lite and 2.0 Flash on a broad sample of prompts,' says Google. advertisementGoogle highlights that despite being lightweight, 2.5 Flash-Lite comes with a full suite of advanced capabilities. These include support for multimodal inputs, a 1 million-token context window, integration with tools like Google Search and code execution, and the flexibility to modulate computational thinking based on budget. According to the company, these features make the Gemini 2.5 Flash-Lite ideal for developers looking to balance efficiency with robust AI 2.5 Flash-Lite availability The Gemini 2.5 Flash-Lite model is currently available in preview via Google AI Studio and Vertex AI. Google has also integrated customised versions of 2.5 Flash-Lite and Flash into its core products like Search, expanding their reach beyond developers to everyday 2.5 Flash and Pro models now available to allIn addition to introducing Flash-Lite, Google has also announced that its Gemini 2.5 Flash and Gemini 2.5 Pro models are now stable and generally available. These models were previously accessible to a select group of developers and organisations for early production to Google, companies like Snap, SmartBear, and creative tools provider Spline have already integrated these models into their workflows with encouraging results. Now that Flash and Pro are fully open, developers can use them in production-grade applications with greater the stable and preview models can be accessed through Google AI Studio, Vertex AI, and the Gemini app.

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Time of India

17-06-2025

Business
Time of India

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Google has expanded its family of Gemini 2.5 of hybrid reasoning AI models . The company said that its Gemini 2.5 Pro and Gemini 2.5 Flash models are now generally available. Further, it released a preview of the new 2.5 Flash-Lite model which it claims is its most cost-efficient and fastest model yet. "We designed Gemini 2.5 to be a family of hybrid reasoning models that provide amazing performance, while also being at the Pareto Frontier of cost and speed," Google stated in its announcement. General availability of Gemini 2.5 Pro and Gemini 2.5 Flash models The generally available versions of Gemini 2.5 Flash and 2.5 Pro are now ready for production applications, a move Google attributes to valuable developer feedback gathered over recent weeks. Adding to the lineup, Google has introduced a preview of Gemini 2.5 Flash-Lite, touted as its most cost-efficient and fastest 2.5 model to date. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Is it better to shower in the morning or at night? Here's what a microbiologist says CNA Read More Undo "Gemini 2.5 Pro + 2.5 Flash are now stable and generally available. Plus, get a preview of Gemini 2.5 Flash-Lite, our fastest + most cost-efficient 2.5 model yet," Google CEO Sundar Pichai said in a post on X. "Exciting steps as we expand our 2.5 series of hybrid reasoning models that deliver amazing performance at the Pareto frontier of cost and speed," he added. Google says that this new version is designed to excel in high-volume, latency-sensitive tasks like translation and classification, offering lower latency than its predecessors, 2.0 Flash-Lite and 2.0 Flash, across a wide range of prompts. Despite its enhanced efficiency, 2.5 Flash-Lite retains the core capabilities that define the Gemini 2.5 family. These include the ability to adjust computational "thinking" based on budget, integrate with tools such as Google Search and code execution, support multimodal input (processing various data types), and offer a substantial 1-million-token context length, the company says. According to Google, the model also demonstrates "all-around higher quality" than 2.0 Flash-Lite across benchmarks in coding, math, science, reasoning, and multimodal tasks. Developers can access the preview of Gemini 2.5 Flash-Lite through Google AI Studio and Vertex AI, alongside the newly stable versions of 2.5 Flash and Pro. Both 2.5 Flash and Pro are also now accessible directly within the Gemini app. Furthermore, custom versions of 2.5 Flash-Lite and Flash have been integrated into Google Search.

Latest news with #Gemini2.5Flash-Lite

Google launches Gemini 2.5 models with pricing & speed updates

Google rolls out budget-friendly Gemini 2.5 Flash Lite, opens 2.5 Flash and Pro to all

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Get Started Now: Download the App