Latest news with #OpenAIo3


TECHx
5 days ago
- Business
- TECHx
Google AI Unveils Deep Think for Enhanced Reasoning
Google has officially announced the launch of Deep Think, the newest and most advanced reasoning model in its Gemini 2.5 series. This AI model offers a refined experience designed to improve logical thinking, problem-solving, and creativity. The rollout is now available exclusively to Google AI Ultra subscribers through the Gemini app. This marks a significant milestone in Google's efforts to develop more capable and intuitive AI systems. According to Google, Deep Think outperformed leading competitor models such as OpenAI o3 and Grok 4 in multiple benchmark tests. The model was first revealed during the recent Google I/O conference after a limited testing phase. Deep Think enhances AI reasoning and creativity Available now for Google AI Ultra subscribers Outperforms competitors in benchmark tests The post Google AI Unveils Deep Think for Enhanced Reasoning first appeared on TECHx Media.


Al Etihad
7 days ago
- Business
- Al Etihad
Google launches 'Deep Think' for AI Ultra subscribers
2 Aug 2025 15:11 WASHINGTON (AGENCIES)Google has officially launched Deep Think, the latest and most advanced reasoning model in its Gemini 2.5 series, offering a refined AI experience designed to enhance logical thinking, problem-solving, and exclusive rollout is now available to Google AI Ultra subscribers through the Gemini app, marking a major milestone in the company's efforts to build more capable and intuitive AI to Google, "Deep Think" outperformed leading competitor models such as "OpenAI o3" and "Grok 4" in multiple benchmark release follows a limited testing phase, and was first unveiled by the company during the recent Google I/O conference.
Yahoo
10-06-2025
- Science
- Yahoo
New version of Gemini beats other AIs at math, science, and reasoning
Google's new Gemini Pro is smarter than other AIs at reasoning, science, and coding. This is according to a series of benchmark results posted by Google on Thursday. In short, Gemini 2.5 Pro beats chief competitors at nearly everything — though we're sure the companies behind those competitors would disagree. According to Google's data, Gemini 2.5 Pro has a healthy lead over OpenAI o3, Claude Opus 4, Grok 3 Beta, and DeepSeek R1, in the Humanity's Last Exam benchmark, which evaluates a model's math, science, knowledge, and reasoning. It's also better at code editing (per the Aider Polyglot benchmark), and it wins over all competitors in several factuality benchmarks including FACTS Grounding, meaning it's less likely to provide factually inaccurate text. The only benchmark in which Gemini 2.5 Pro isn't a clear winner is the mathematics-focused AIME 2025, and even there the differences between results are pretty small. SEE ALSO: Gemini now autogenerates summaries for long Gmail threads As a result of all the improvements in Gemini 2.5 Pro, this model is now on top of the LMArena leaderboard with a score of 1470. There's a catch, though: The final version of Gemini 2.5 Pro isn't widely available yet. Google calls this latest version an "upgraded preview," with a stable version coming "in a couple of weeks." The preview should now be available in the Gemini app, though.


Economic Times
24-05-2025
- Economic Times
OpenAI upgrades Operator with o3 model for enhanced reasoning, safety
OpenAI is updating the artificial intelligence (AI) model powering Operator, its AI agent that can autonomously browse the web and interact with certain software inside a cloud-hosted virtual machine to carry out user requests. Operator will soon run on a model based on o3, one of the latest in OpenAI's o series of 'reasoning' models. Previously, Operator relied on a customised version of GPT-4o. By several benchmarks, o3 is a more advanced model, particularly on tasks requiring mathematical ability and reasoning. 'We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3,' OpenAI wrote in a blog post. 'The API version (of Operator) will remain based on 4o.' Operator is part of a growing set of agentic tools developed by AI firms as they compete to build agents capable of reliably performing digital tasks with minimal supervision. Google offers a similar agent through its Gemini API, which can browse the web and take actions on users' behalf. It also offers a consumer-facing version called Mariner. Anthropic's models can perform various computer tasks as well, including opening files and navigating webpages. According to OpenAI, the upgraded Operator model, dubbed o3 Operator, was 'fine-tuned with additional safety data for computer use,' using datasets designed to 'teach the model (OpenAI's) decision boundaries on confirmations and refusals.'The company has released a technical report detailing o3 Operator's performance in safety evaluations. Compared to the GPT-4o version, the new model is less likely to carry out illicit activities, search for sensitive personal data or fall prey to prompt injection, a common AI attack technique.'o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator,' OpenAI wrote in its blog post. 'Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal.'


Time of India
24-05-2025
- Business
- Time of India
OpenAI upgrades Operator with o3 model for enhanced reasoning, safety
OpenAI is updating the artificial intelligence (AI) model powering Operator, its AI agent that can autonomously browse the web and interact with certain software inside a cloud-hosted virtual machine to carry out user requests. Operator will soon run on a model based on o3, one of the latest in OpenAI's o series of 'reasoning' models. Previously, Operator relied on a customised version of GPT-4o . By several benchmarks, o3 is a more advanced model, particularly on tasks requiring mathematical ability and reasoning. 'We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3,' OpenAI wrote in a blog post . 'The API version (of Operator) will remain based on 4o.' Operator is part of a growing set of agentic tools developed by AI firms as they compete to build agents capable of reliably performing digital tasks with minimal supervision. Google offers a similar agent through its Gemini API, which can browse the web and take actions on users' behalf. It also offers a consumer-facing version called Mariner. Anthropic's models can perform various computer tasks as well, including opening files and navigating webpages. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories According to OpenAI, the upgraded Operator model, dubbed o3 Operator, was 'fine-tuned with additional safety data for computer use,' using datasets designed to 'teach the model (OpenAI's) decision boundaries on confirmations and refusals.' The company has released a technical report detailing o3 Operator's performance in safety evaluations. Compared to the GPT-4o version, the new model is less likely to carry out illicit activities, search for sensitive personal data or fall prey to prompt injection, a common AI attack technique. 'o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator,' OpenAI wrote in its blog post. 'Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal.'