logo
#

Latest news with #OpenAIo3

OpenAI upgrades Operator with o3 model for enhanced reasoning, safety
OpenAI upgrades Operator with o3 model for enhanced reasoning, safety

Economic Times

time24-05-2025

  • Economic Times

OpenAI upgrades Operator with o3 model for enhanced reasoning, safety

OpenAI is updating the artificial intelligence (AI) model powering Operator, its AI agent that can autonomously browse the web and interact with certain software inside a cloud-hosted virtual machine to carry out user requests. Operator will soon run on a model based on o3, one of the latest in OpenAI's o series of 'reasoning' models. Previously, Operator relied on a customised version of GPT-4o. By several benchmarks, o3 is a more advanced model, particularly on tasks requiring mathematical ability and reasoning. 'We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3,' OpenAI wrote in a blog post. 'The API version (of Operator) will remain based on 4o.' Operator is part of a growing set of agentic tools developed by AI firms as they compete to build agents capable of reliably performing digital tasks with minimal supervision. Google offers a similar agent through its Gemini API, which can browse the web and take actions on users' behalf. It also offers a consumer-facing version called Mariner. Anthropic's models can perform various computer tasks as well, including opening files and navigating webpages. According to OpenAI, the upgraded Operator model, dubbed o3 Operator, was 'fine-tuned with additional safety data for computer use,' using datasets designed to 'teach the model (OpenAI's) decision boundaries on confirmations and refusals.'The company has released a technical report detailing o3 Operator's performance in safety evaluations. Compared to the GPT-4o version, the new model is less likely to carry out illicit activities, search for sensitive personal data or fall prey to prompt injection, a common AI attack technique.'o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator,' OpenAI wrote in its blog post. 'Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal.'

OpenAI upgrades Operator with o3 model for enhanced reasoning, safety
OpenAI upgrades Operator with o3 model for enhanced reasoning, safety

Time of India

time24-05-2025

  • Business
  • Time of India

OpenAI upgrades Operator with o3 model for enhanced reasoning, safety

OpenAI is updating the artificial intelligence (AI) model powering Operator, its AI agent that can autonomously browse the web and interact with certain software inside a cloud-hosted virtual machine to carry out user requests. Operator will soon run on a model based on o3, one of the latest in OpenAI's o series of 'reasoning' models. Previously, Operator relied on a customised version of GPT-4o . By several benchmarks, o3 is a more advanced model, particularly on tasks requiring mathematical ability and reasoning. 'We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3,' OpenAI wrote in a blog post . 'The API version (of Operator) will remain based on 4o.' Operator is part of a growing set of agentic tools developed by AI firms as they compete to build agents capable of reliably performing digital tasks with minimal supervision. Google offers a similar agent through its Gemini API, which can browse the web and take actions on users' behalf. It also offers a consumer-facing version called Mariner. Anthropic's models can perform various computer tasks as well, including opening files and navigating webpages. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories According to OpenAI, the upgraded Operator model, dubbed o3 Operator, was 'fine-tuned with additional safety data for computer use,' using datasets designed to 'teach the model (OpenAI's) decision boundaries on confirmations and refusals.' The company has released a technical report detailing o3 Operator's performance in safety evaluations. Compared to the GPT-4o version, the new model is less likely to carry out illicit activities, search for sensitive personal data or fall prey to prompt injection, a common AI attack technique. 'o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator,' OpenAI wrote in its blog post. 'Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal.'

OpenAI upgrades the AI model powering its Operator agent
OpenAI upgrades the AI model powering its Operator agent

Yahoo

time23-05-2025

  • Yahoo

OpenAI upgrades the AI model powering its Operator agent

OpenAI is updating the AI model powering Operator, its AI agent that can autonomously browse the web and use certain software within a cloud-hosted virtual machine to fulfill users' requests. Soon, Operator will use a model based on o3, one of the latest in OpenAI's o series of "reasoning" models. Previously, Operator relied on a custom version of GPT-4o. By many benchmarks, o3 is a far more advanced model, particularly on tasks involving math and reasoning. "We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3," OpenAI wrote in a blog post. "The API version [of Operator] will remain based on 4o." Operator is one among many agentic tools released by AI companies in recent months. Companies are racing to make highly sophisticated agents that can reliably carry out chores more or less without supervision. Google offers a "computer use" agent through its Gemini API that can similarly browse the web and take actions on behalf of users, as well as a more consumer-focused offering called Mariner. Anthropic's models are also able to perform computer tasks, including opening files and navigating web pages. According to OpenAI, the new Operator model, called o3 Operator, was "fine-tuned with additional safety data for computer use," including datasets designed to "teach the model [OpenAI's] decision boundaries on confirmations and refusals." OpenAI has released a technical report showing o3 Operator's performance on specific safety evaluations. Compared to the GPT-4o Operator model, o3 Operator is less likely to refuse to perform "illicit" activities and search for sensitive personal data, and less susceptible to a form of AI attack known as prompt injection, per the technical report. "o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator," OpenAI wrote in its blog post. "Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal." This article originally appeared on TechCrunch at Sign in to access your portfolio

OpenAI upgrades the AI model powering its Operator agent
OpenAI upgrades the AI model powering its Operator agent

Yahoo

time23-05-2025

  • Yahoo

OpenAI upgrades the AI model powering its Operator agent

OpenAI is updating the AI model powering Operator, its AI agent that can autonomously browse the web and use certain software within a cloud-hosted virtual machine to fulfill users' requests. Soon, Operator will use a model based on o3, one of the latest in OpenAI's o series of "reasoning" models. Previously, Operator relied on a custom version of GPT-4o. By many benchmarks, o3 is a far more advanced model, particularly on tasks involving math and reasoning. "We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3," OpenAI wrote in a blog post. "The API version [of Operator] will remain based on 4o." Operator is one among many agentic tools released by AI companies in recent months. Companies are racing to make highly sophisticated agents that can reliably carry out chores more or less without supervision. Google offers a "computer use" agent through its Gemini API that can similarly browse the web and take actions on behalf of users, as well as a more consumer-focused offering called Mariner. Anthropic's models are also able to perform computer tasks, including opening files and navigating webpages. According to OpenAI, the new Operator model, called o3 Operator, was "fine-tuned with additional safety data for computer use," including data sets designed to "teach the model [OpenAI's] decision boundaries on confirmations and refusals." OpenAI has released a technical report showing o3 Operator's performance on specific safety evaluations. Compared to the GPT-4o Operator model, o3 Operator is less likely to refuse to perform "illicit" activities and search for sensitive personal data, and less susceptible to a form of AI attack known as prompt injection, per the technical report. "o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator," OpenAI wrote in its blog post. "Although o3 Operator inherits o3's coding capabilities, it does not have native access to a coding environment or terminal." Error while retrieving data Sign in to access your portfolio Error while retrieving data

OpenAI takes on Google Gemini Anthropic with AI coding agent for ChatGPT
OpenAI takes on Google Gemini Anthropic with AI coding agent for ChatGPT

Economic Times

time16-05-2025

  • Business
  • Economic Times

OpenAI takes on Google Gemini Anthropic with AI coding agent for ChatGPT

Reuters FILE PHOTO: OpenAI logo is seen in this illustration taken March 31, 2023. REUTERS/Dado Ruvic/Illustration/File Photo OpenAI launched a research preview of Codex, a cloud-based software engineering agent on Friday. The AI coding agent is powered by codex-1, a version of OpenAI o3 optimized for software engineering, the AI platform can write features, answer questions about codebases, fix bugs, and propose pull requests for review. Each task will run in its own cloud sandbox, preloaded with the user's repository. OpenAI said Codex will be available on ChatGPT Pro, Enterprise, and Team users today, with support for Plus and Edu coming soon. It can be accessed through the ChatGPT sidebar, and assigned new tasks by typing a prompt and clicking 'Code'. Users can ask questions about a codebase by clicking 'Ask'. Codex's actions can be seen through citations of terminal logs and test outputs, helping trace each step taken. Users can then review the results, request further revisions, open a GitHub pull request, or directly integrate the changes on their workspaces. OpenAI said Codex was trained to identify and refuse requests aimed at the development of malicious software, addressing concerns that malicious actors could misuse this sophisticated coding agent for cyber attacks and other harmful uses. Apart from OpenAI, Microsoft-owned GitHub, Google and Anthropic, along with startups including Anysphere and Windsurf, offer AI tools for to aid programmers. Earlier this month, Google DeepMind added vastly improved coding capabilities to Gemini 2.5 Pro (Preview). In the run-up to its recently concluded Google I/O 2025 event, the search major released the AI agenct, now branded the I/O Edition. Internally labelled gemini-2.5-pro-preview-05-06, the model can now deliver significant improvements in code transformation, code editing, and even in developing complex agentic workflows — making it far more capable for software developers and engineers, according to Google.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store