
OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
It was a big week for AI announcements following events from Microsoft, Google, and Anthropic. But OpenAI is finishing things out with news of its own. And no, we're not just talking about its $6.5 billion acquisition of Jony Ive's design team to lead a new hardware effort, 'io' at OpenAI.
Today, the company upgraded its Operator autonomous web browsing and cursor controlling agent within ChatGPT from using the prior GPT-4o multimodal large language model to the newer and more powerful o3 reasoning model.
The update, released globally today, May 23, 2025, is available as a 'research preview' to paying subscribers of OpenAI's $200 USD-monthly ChatGPT Pro plan.
Basically, that is OpenAI's way of saying it's not a fully 'sanded down' or perfected product yet — it may still have kinks and issues.
But with rival Google offering its own top tier AI subscription bundle for a price of nearly $250 USD regularly (currently running a discount down to $125 for the first three months) to access its latest Gemini multimodal, Imagen image generation, and Veo video generation models, suddenly OpenAI's ChatGPT Pro plan seems more affordable by comparison.
Operator first debuted in January 2025 as OpenAI's initial step into semi-autonomous agents, specifically Computer Using Agents (CUAs). The idea is to go beyond the chatbot interface of ChatGPT and allow OpenAI's powerful AI models to start taking more actions on behalf of the user.
Thus, Operator was designed to autonomously point, click, scroll, and type to complete web-based tasks such as booking dinner reservations, compiling shopping lists, or ordering event tickets. This agentic capability allows it to complete user tasks directly through a browser interface, from booking reservations to gathering online data.
Read More Daedalic closes game development after Gollum flop
For safety, privacy and security purposes, Operator didn't use any existing web browser on a user's PC or Mac. Instead, it ran in a cloud-hosted virtual browser accessible via a standalone site—operator.chatgpt.com—where users could input requests and observe the agent perform tasks in real time.
It combined vision, reasoning, and interaction capabilities based on GPT-4o, marking a new direction for OpenAI in agentic AI.
The product was launched as a research preview for ChatGPT Pro subscribers and featured built-in safety measures like user confirmations, Watch Mode, and restrictions on high-risk web platforms.
It was also being tested in enterprise contexts, including travel planning and civic services, demonstrating its potential across both consumer and business environments.
With this update, OpenAI aims to enhance performance across several key dimensions. The new o3-based Operator demonstrates improved persistence and accuracy during browser interactions.
In practical terms, this means it is more likely to complete user tasks successfully and with less need for correction or repetition. Moreover, users can expect responses that are clearer, more structured, and more comprehensive.
In comparative evaluations, the new model shows a distinct preference advantage over its predecessor. Human preference studies reveal that users favor the o3 model for its style, comprehensiveness, and clarity. It also performs strongly in instruction following and efficiency, though results for factual correctness are more balanced between versions.
Performance on third-party evaluation benchmarks reflects these enhancements. On the OSWorld benchmark that measures completion of browser-based tasks, the o3 model scores 42.9 compared to 38.1 for the previous version.
However, OpenAI notes that due to limitations in the automated grading system, the actual performance gain could be closer to 20 percentage points!
On WebArena, the new model achieved a score of 62.9, up from 48.1. The most dramatic improvement appears on the GAIA benchmark, where the o3 model scores 62.2, vastly surpassing the prior model's 12.3.
Side-by-side task comparisons further illustrate these gains. In one example involving a restaurant booking request, the new model provided a clearer and more detailed list of available reservations, including locations, Michelin ratings, and seating notes, presented in a well-formatted table. The previous version, while functional, delivered less information in a less organized manner, according to an image included with the new o3 Operator release notes:
Safeguards remain, as do general cautionary notes about usage on sensitive, financial transactions and account access
The o3 model also inherits the safety measures introduced with earlier versions, with further fine-tuning for its role as an agentic system.
OpenAI has integrated enhanced training against harmful task execution, prompt injection vulnerabilities, and mistakes involving user intent.
Evaluations show that the model now confirms 94% of sensitive actions before executing them, with 100% confirmation in financial transactions. Prompt injection susceptibility has also decreased from 23% to 20%.
Notably, the o3 Operator maintains a cautious boundary on certain high-risk web interactions, such as email or financial platforms, where it may require user supervision via Watch Mode or explicitly refuse to proceed. These measures are part of a layered approach to safety that combines model-level robustness with real-time monitoring.
While the upgrade to Operator marks a technical improvement, it also reflects OpenAI's ongoing commitment to responsible AI deployment.
The system's ability to take real-world actions introduces new risks, and the development team continues to refine its safety protocols accordingly.
Read More Approaching the issue of diversity in the tech industry
According to OpenAI's updated o3 system card documentation, the model remains below high-risk capability thresholds in categories such as biological and chemical misuse and has no native coding environment or terminal access, further reducing potential misuse vectors.
Operator remains a research preview and is accessible only to ChatGPT Pro users. The Responses API version of Operator will continue to be based on the GPT-4o model, at least for now.
The upgraded Operator stands to significantly enhance the workflows of professionals in AI engineering, orchestration, data management, and IT security.
For those building or maintaining machine learning models, the model's improved accuracy and structured outputs reduce the overhead of test validation and troubleshooting.
In orchestration contexts, it offers a practical, reliable tool for automating browser-based components of complex pipelines.
Data engineers can delegate manual web interactions—such as data verification and scraping—with more confidence, freeing time for higher-level optimization work.
Security professionals, meanwhile, gain a safer way to simulate user behavior in audits and incident response exercises, thanks to the model's layered safety mechanisms.
Across these disciplines, the o3-based Operator introduces both a capability upgrade and a risk mitigation framework, making it a practical addition to the modern technical toolkit.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Gizmodo
21 minutes ago
- Gizmodo
Windows 11 Pro Is Going for Pennies on the Dollar, a 92% Price Drop Makes It Almost Free
If you're still using Windows 10 on your PC, or you've only upgraded to Windows 11 Home, you really need to get with the times. (No offense.) Windows 11 Pro is the standard bearer of PC operating systems now, representing a huge leap forward from Windows 10 and adding superior networking and security features to Windows 11 Home. Plus, a lifetime license to Windows 11 Pro costs less than a good mousepad now that it's selling at StackSocial for just $15. See at StackSocial Even if you're already running Windows 11 Pro, it's probably a good idea to have a spare license handy. Quick — do you know where your Windows license code or activation key is in case your PC desktop or laptop completely bricks on you? It's probably sitting somewhere on that bricked machine, and gone forever. A $15 backup OS is well worth heading off that headache. Best. Windows. Ever. While many of the previous iterations of the Windows OS earned middling reviews at best for its interface and ease of use, Microsoft really brought its A game to giving Windows 11 Pro a desktop and UX that's easy to navigate and easy on the eyes. It's the most user-friendly version of Windows yet, and also the one with the best potential for enhanced productivity. Snap layouts and desktops and improved voice typing and search features will come in handy on a daily basis. Windows 11 Pro also comes with Copilot, the highly rated AI assistant that not only helps you with searches and queries, it can also help you navigate Windows and your computer. Copilot is the fast and foolproof way to update the settings on your machine or search for files, and as a powerful AI tool it can also create images from your prompts, give you writing prompts for emails or other writing endeavors, and it can easily integrate with platforms like GitHub for collaborative work. Copilot is just a Windows logo key + C click away, or if you have a newer machine, there's a dedicated Copilot key waiting for you. All the Hits The Microsoft apps you need are already present and accounted for, including Teams and Widgets, along with advanced features like Azure AD, Hyper-V, BitLocker, and more. Gaming gets a big boost from Windows 11 Pro in the form of DirectX 12 Ultimate for amazing graphics, and the biometric login, encrypted authentications, and advanced defense against viruses and malware are your personal safety net. A 92% sale on basically anything is worth a second look, and when it's on the best and most recent operating system for your PC, it's one to jump on. Head to StackSocial now and get your lifetime subscription to Windows 11 Pro for just $15. See at StackSocial


CNBC
an hour ago
- CNBC
Harvard-trained educator: Kids who learn how to use AI will become smarter adults—if they avoid this No. 1 mistake
Students that copy and paste ChatGPT answers into their assignments, with little thinking involved, are doing themselves a disservice — especially because artificial intelligence really can help students become better learners, according to psychologist and author Angela Duckworth. Instead of distrusting AI, show kids how to properly use it, Duckworth advised in a speech at the University of Pennsylvania's Graduate School of Education commencement ceremony on May 17. Teachers and parents alike can show them how to use the technology's full potential by asking AI models follow-up questions, so they can learn — in detail — how chatbots came to their conclusions, she said. "AI isn't always a crutch, it can also be a coach," said Duckworth, who studied neurobiology at Harvard University and now teaches psychology at the University of Pennsylvania. "In my view, [ChatGPT] has a hidden pedagogical superpower. It can teach by example." Duckworth was skeptical about AI until she found herself stumped by a statistics concept, and in the interest of saving time, asked ChatGPT for help, she said. The chatbot gave her a definition of the concept, a couple of examples and some common misuses. Wanting clarification, she asked follow-up questions and for a demonstration, she said. After 10 minutes of using the technology, she walked away with a clear understanding of the Benjamini-Hochberg procedure, "a pretty sophisticated statistical procedure," she said. "AI helped me reach a level of understanding that far exceed what I could achieve on my own," said most advanced generative AI models suffer from hallucinations and factual inaccuracies, data shows — meaning you should always double check its factual claims, and teach kids to do the same. The topic of "how to use AI" should even find its way into school curricula, billionaire entrepreneur and investor Mark Cuban similarly suggested in a New York magazine interview, which published on Tuesday. "The challenge isn't that kids are using it. The challenge is that schools haven't adapted to the that it's available and kids are literate in using it," Cuban said, adding that simply knowing what questions to ask AI is a valuable skillset. Since AI tools do make mistakes, you can likely benefit most directly by using them for tasks that don't involve your final product, side hustle expert Kathy Kristof told CNBC Make It in February. You might, for example, ask a chatbot to create a bullet-point outline for your next writing project — rather than asking it to write the final draft for you. "While I still see AI making a lot of mistakes, picking up errors or outdated information, using AI to create a first draft of something that's then reviewed and edited by human intelligence seems like a no-brainer," said Kristof, founder of the blog. A recent study, conducted by one of Duckworth's doctorate students, followed participants — some of whom were allowed to use chatbots — as they practiced writing cover letters. When later asked to write a cover letter without any assistance, the group that had used AI produced stronger letters on their own, the research shows. The study, published in January, has not yet been peer-reviewed. "Over and over, I watched [ChatGPT] shorten sentences that were too long, weed out needless repetition and even reorder ideas so they flowed more logically," Duckworth said, referencing the research. ,
Yahoo
an hour ago
- Yahoo
Nvidia can't be stopped, Apple falls behind, and the AI data center race: Tech news roundup
When Microsoft (MSFT) pulled the plug on planned data centers in Ohio last month and a Wells Fargo (WFC) report suggested Amazon (AMZN) Web Services was reconsidering some leases, market watchers quickly diagnosed the symptoms: AI bubble concerns, demand uncertainty, and the inevitable cooldown after years of breakneck expansion. Read More The 'Magnificent Seven' tech stocks led the market's post-pandemic boom. But as Big Tech sprints into the AI future, one big name is falling dangerously behind: Apple (AAPL). Once an undisputed tech heavyweight, it risks becoming the least magnificent of them all. Read More Nvidia (NVDA) continues to go beyond expectations — even if things are a little more complicated this time around. Its strong first-quarter headline numbers show that Nvidia's AI thesis is as strong as ever and that its margins remain elite, despite facing significant headwinds due to U.S. export restrictions on its H20 processors to China and other geopolitical concerns. Read More A new study from Google Researchers is raising questions about whether quantum computing will hamper your ability to keep your crypto wallet secure. Google's (GOOGL) Craig Gidney, a quantum research scientist, and Sophie Schmieg, a senior staff cryptography engineer, published a blog post on Friday showing a quantum computer could potentially break RSA encryption — the public-key encryption algorithm used to secure data such as for cryptocurrencies — with 20 times fewer quantum resources than they previously believed. Read More Anthropic's CEO Dario Amodei has issued a stark warning: Artificial intelligence could eliminate up to half of all entry-level white-collar jobs, pushing U.S. unemployment to 10–20% within the next one to five years. Read More Nvidia's (NVDA) delicate dance in China continues. Amid U.S. export restrictions on its advanced AI chips, the chip giant valued at $3.3 trillion is reworking its product line — again — to maintain its hold on one of its most important markets without crossing Washington. Read More Salesforce said Tuesday it will acquire the cloud data management company Informatica for $8 billion in equity value as it seeks to further compete in the global artificial intelligence race. Read More Most companies can't shrug off an $8 billion loss. Then again, most companies aren't Nvidia (NVDA). On Wednesday's first-quarter earnings call, Wall Street zeroed in on Nvidia's obvious weak spot: China. Thanks to U.S. export restrictions, Nvidia's custom-built H20 chips, designed to skirt earlier rules, have essentially been made worthless. Nvidia CFO Colette Kress confirmed the damage: 'Had the export controls not occurred, we would have had orders of about $8 billion for H20' in the quarter. Read More Space X's big ambitions came crashing back to Earth on Tuesday, when its Starship made what the company calls a 'rapid unscheduled disassembly' six minutes after launch, with parts landing in the Indian Ocean. Read More For the latest news, Facebook, Twitter and Instagram. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data