logo
#

Latest news with #ChatGPTPro

OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing
OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing

Business Mayor

time24-05-2025

  • Business
  • Business Mayor

OpenAI updates Operator to o3, making its $200 monthly ChatGPT Pro subscription more enticing

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More It was a big week for AI announcements following events from Microsoft, Google, and Anthropic. But OpenAI is finishing things out with news of its own. And no, we're not just talking about its $6.5 billion acquisition of Jony Ive's design team to lead a new hardware effort, 'io' at OpenAI. Today, the company upgraded its Operator autonomous web browsing and cursor controlling agent within ChatGPT from using the prior GPT-4o multimodal large language model to the newer and more powerful o3 reasoning model. The update, released globally today, May 23, 2025, is available as a 'research preview' to paying subscribers of OpenAI's $200 USD-monthly ChatGPT Pro plan. Basically, that is OpenAI's way of saying it's not a fully 'sanded down' or perfected product yet — it may still have kinks and issues. But with rival Google offering its own top tier AI subscription bundle for a price of nearly $250 USD regularly (currently running a discount down to $125 for the first three months) to access its latest Gemini multimodal, Imagen image generation, and Veo video generation models, suddenly OpenAI's ChatGPT Pro plan seems more affordable by comparison. Operator first debuted in January 2025 as OpenAI's initial step into semi-autonomous agents, specifically Computer Using Agents (CUAs). The idea is to go beyond the chatbot interface of ChatGPT and allow OpenAI's powerful AI models to start taking more actions on behalf of the user. Thus, Operator was designed to autonomously point, click, scroll, and type to complete web-based tasks such as booking dinner reservations, compiling shopping lists, or ordering event tickets. This agentic capability allows it to complete user tasks directly through a browser interface, from booking reservations to gathering online data. Read More Daedalic closes game development after Gollum flop For safety, privacy and security purposes, Operator didn't use any existing web browser on a user's PC or Mac. Instead, it ran in a cloud-hosted virtual browser accessible via a standalone site— users could input requests and observe the agent perform tasks in real time. It combined vision, reasoning, and interaction capabilities based on GPT-4o, marking a new direction for OpenAI in agentic AI. The product was launched as a research preview for ChatGPT Pro subscribers and featured built-in safety measures like user confirmations, Watch Mode, and restrictions on high-risk web platforms. It was also being tested in enterprise contexts, including travel planning and civic services, demonstrating its potential across both consumer and business environments. With this update, OpenAI aims to enhance performance across several key dimensions. The new o3-based Operator demonstrates improved persistence and accuracy during browser interactions. In practical terms, this means it is more likely to complete user tasks successfully and with less need for correction or repetition. Moreover, users can expect responses that are clearer, more structured, and more comprehensive. In comparative evaluations, the new model shows a distinct preference advantage over its predecessor. Human preference studies reveal that users favor the o3 model for its style, comprehensiveness, and clarity. It also performs strongly in instruction following and efficiency, though results for factual correctness are more balanced between versions. Performance on third-party evaluation benchmarks reflects these enhancements. On the OSWorld benchmark that measures completion of browser-based tasks, the o3 model scores 42.9 compared to 38.1 for the previous version. However, OpenAI notes that due to limitations in the automated grading system, the actual performance gain could be closer to 20 percentage points! On WebArena, the new model achieved a score of 62.9, up from 48.1. The most dramatic improvement appears on the GAIA benchmark, where the o3 model scores 62.2, vastly surpassing the prior model's 12.3. Side-by-side task comparisons further illustrate these gains. In one example involving a restaurant booking request, the new model provided a clearer and more detailed list of available reservations, including locations, Michelin ratings, and seating notes, presented in a well-formatted table. The previous version, while functional, delivered less information in a less organized manner, according to an image included with the new o3 Operator release notes: Safeguards remain, as do general cautionary notes about usage on sensitive, financial transactions and account access The o3 model also inherits the safety measures introduced with earlier versions, with further fine-tuning for its role as an agentic system. OpenAI has integrated enhanced training against harmful task execution, prompt injection vulnerabilities, and mistakes involving user intent. Evaluations show that the model now confirms 94% of sensitive actions before executing them, with 100% confirmation in financial transactions. Prompt injection susceptibility has also decreased from 23% to 20%. Notably, the o3 Operator maintains a cautious boundary on certain high-risk web interactions, such as email or financial platforms, where it may require user supervision via Watch Mode or explicitly refuse to proceed. These measures are part of a layered approach to safety that combines model-level robustness with real-time monitoring. While the upgrade to Operator marks a technical improvement, it also reflects OpenAI's ongoing commitment to responsible AI deployment. The system's ability to take real-world actions introduces new risks, and the development team continues to refine its safety protocols accordingly. Read More Approaching the issue of diversity in the tech industry According to OpenAI's updated o3 system card documentation, the model remains below high-risk capability thresholds in categories such as biological and chemical misuse and has no native coding environment or terminal access, further reducing potential misuse vectors. Operator remains a research preview and is accessible only to ChatGPT Pro users. The Responses API version of Operator will continue to be based on the GPT-4o model, at least for now. The upgraded Operator stands to significantly enhance the workflows of professionals in AI engineering, orchestration, data management, and IT security. For those building or maintaining machine learning models, the model's improved accuracy and structured outputs reduce the overhead of test validation and troubleshooting. In orchestration contexts, it offers a practical, reliable tool for automating browser-based components of complex pipelines. Data engineers can delegate manual web interactions—such as data verification and scraping—with more confidence, freeing time for higher-level optimization work. Security professionals, meanwhile, gain a safer way to simulate user behavior in audits and incident response exercises, thanks to the model's layered safety mechanisms. Across these disciplines, the o3-based Operator introduces both a capability upgrade and a risk mitigation framework, making it a practical addition to the modern technical toolkit.

OpenAI takes on Google, Anthropic with new AI agent for coders
OpenAI takes on Google, Anthropic with new AI agent for coders

The Star

time19-05-2025

  • Business
  • The Star

OpenAI takes on Google, Anthropic with new AI agent for coders

OpenAI is rolling out a new artificial intelligence agent for ChatGPT users that's designed to help streamline software development as the company pushes into a crowded market of startups and large tech firms offering AI tools for coders. The agent, called Codex, will be able to write software features, fix bugs and run tests, the company said in a blog post Friday. Codex, which is still in the early stages and has limited functionality, is geared towards workers with some technical knowledge and will first be released as a "research preview' to paid ChatGPT Pro, Enterprise and Team users. A growing number of tech companies, including Microsoft Corp.-owned Github, Alphabet Inc.'s Google and Anthropic, offer AI tools for programmers. Some startups, including Cursor maker Anysphere and Windsurf, have also attracted users and investors with AI-infused coding assistants that can analyze a software developer's actions and suggest the next few lines. In a sign of how important this emerging market is to the company, OpenAI is in talks to buy Windsurf for about $3 billion, Bloomberg News has reported. The deal would be the company's largest acquisition to date. AI agents are billed as tools that can field more complex requests on behalf of users with minimal supervision. OpenAI said its technical staff are already using the coding agent daily for a range of work, from repetitive tasks to helping build new features. Other companies, including Cisco Systems Inc. and Kodiak Robotics, have also been using the tool, OpenAI said. "We're just seeing very fast progress in the model's ability to solve coding and software engineering problems,' said Josh Tobin, research lead on agents at OpenAI. "We see this as a particularly fast way for us to get to that agents vision.' Codex runs on a version of OpenAI's o3 AI reasoning model that is optimized for software engineering. The tool can take anywhere from one to 30 minutes to complete a task, depending on complexity. OpenAI also said Codex was trained to identify and refuse requests aimed at the development of malicious software, a nod to concerns that bad actors could turn to more sophisticated coding agents for cyber attacks and other harmful uses. – Bloomberg

What is Codex, OpenAI's latest AI coding agent capable of multitasking?
What is Codex, OpenAI's latest AI coding agent capable of multitasking?

Indian Express

time18-05-2025

  • Business
  • Indian Express

What is Codex, OpenAI's latest AI coding agent capable of multitasking?

OpenAI on Friday, May 16, introduced a new AI tool called Codex that is designed to handle multiple software engineering-related tasks at the same time, from generating code for new features to answering questions about a user's codebase, fixing bugs, and suggesting pull requests for code review The cloud-based, AI agent-driven coding tool runs these tasks in its own cloud sandbox environment that has been preloaded with a user's code repository. Codex has been released under research preview. However, all ChatGPT Pro, Enterprise, and Team users have access to the AI coding tool. 'Users will have generous access at no additional cost for the coming weeks so you can explore what Codex can do, after which we'll roll out rate-limited access and flexible pricing options that let you purchase additional usage on-demand,' OpenAI said in a blog post. ChatGPT Plus and Edu customers will be given access at a later date, the Microsoft-backed AI startup added. today we are introducing codex. it is a software engineering agent that runs in the cloud and does tasks for you, like writing a new feature of fixing a bug. you can run many tasks in parallel. — Sam Altman (@sama) May 16, 2025 OpenAI's latest offering comes at a time when AI is poised to disrupt the software engineering sector, raising widespread fears of job displacement. Microsoft CEO Satya Nadella recently said that 30 per cent of the company's code is now AI-generated. A few weeks later, the tech giant announced it is laying off 6,000 employees or 3 per cent of its workforce, with programmers reportedly being impacted the most. 'It still remains essential for users to manually review and validate all agent-generated code before integration and execution,' OpenAI noted in its Codex announcement blog post. With Codex, developers can delegate simple programming tasks to an AI agent. It has its own unique interface that can be accessed from the side bar in the ChatGPT web app. Codex is powered by codex-1, an AI model that is a variation of OpenAI's o3 reasoning model. Except that codex-1 has been specifically trained on a wide range of real-world coding tasks to analyse and generate code 'that closely mirrors human style and PR preferences, adheres precisely to instructions.' Its outputs have further been fine-tuned using reinforcement learning so that codex-1 can 'iteratively run tests until it receives a passing result.' In terms of performance and accuracy, OpenAI said that codex-1 fared better than its o3 AI model when evaluated on its internal SWE benchmark as well as the company's human-validated version of it (SWE-bench Verified). Codex can read and edit files as well as run commands including test harnesses, linters, and type checkers. It typically takes anywhere between one minute to 30 minutes to complete a task depending on the difficulty level, as per OpenAI. The AI coding agent performs each task in a distinct, isolated environment that is preloaded with the user's codebase serving as context. 'Like human developers, Codex agents perform best when provided with configured dev environments, reliable testing setups, and clear documentation,' OpenAI said. Users can make Codex work more effectively for them by including files placed within their repository. 'These are text files, akin to where you can inform Codex how to navigate your codebase, which commands to run for testing, and how best to adhere to your project's standard practices,' OpenAI further said. Another unique feature of Codex is that it shows its thinking and work with every step as it goes about completing the task(s). In the past, several developers have pointed out that AI coding agents produce coding scripts that do not follow standards and are difficult to debug. 'Codex provides verifiable evidence of its actions through citations of terminal logs and test outputs, allowing you to trace each step taken during task completion,' OpenAI said. Once Codex completes a task, it commits its changes in its environment. However, users can also review the results, request further revisions, open a GitHub pull request, or directly make changes in the local development environment. In order for Codex to start generating code, users need to enter a prompt and click on 'code'. If they want the AI coding agents to answer questions or provide suggestions, then users need to select the 'ask' option before submitting the prompt. When OpenAI opened up early access to Codex for external partners, they used the AI coding agent tool to accelerate feature development, debug issues, write and execute tests, and refactor large codebases. Another early tester used Codes to speed up small but repetitive tasks like improving test coverage and fixing integration failures.' It can also be used to write debugging tools and help developers understand unfamiliar parts of the codebase by surfacing relevant context and past changes. OpenAI developers are also using Codex internally for refactoring, renaming, and writing tests as well as scaffolding new features, wiring components, fixing bugs, and drafting documentation. 'Based on learnings from early testers, we recommend assigning well-scoped tasks to multiple agents simultaneously, and experimenting with different types of tasks and prompts to explore the model's capabilities effectively,' the company said. In April this year, OpenAI launched another AI coding agent tool called Codex CLI. It is said to be an open-source, command-line tool capable of reading, modifying, and running code locally on a user's terminal. The coding agent integrates OpenAI's models with the client's command-line interface (CLI) used to run programmes, manage files, and more. Codex CLI is powered by OpenAI's latest o4-mini model by default. However, users can choose their preferred OpenAI model via the Responses API option. Codex CLI can only run on macOS and Linux systems for now, with support for Windows still in the experimental stage. In Friday's blog post, OpenAI also announced updates to Codex CLI. A smaller version of codex-1 is coming to Codex CLI. 'It's available now as the default model in Codex CLI and in the API as codex-mini-latest,' OpenAI said. The company has also simplified the developer log-in process for Codex CLI. Instead of having to manually generate and configure an API token, developers can now use their ChatGPT account to sign into Codex CLI and select the API organisation they want to use. 'Plus and Pro users who sign in to Codex CLI with ChatGPT can also begin redeeming $5 and $50 in free API credits, respectively, later today for the next 30 days,' OpenAI said.

With OpenAI's New Programming Agent Making Headlines, Here's Why MIND of Pepe Is DeFi's Best AI Agent
With OpenAI's New Programming Agent Making Headlines, Here's Why MIND of Pepe Is DeFi's Best AI Agent

Business Mayor

time17-05-2025

  • Business
  • Business Mayor

With OpenAI's New Programming Agent Making Headlines, Here's Why MIND of Pepe Is DeFi's Best AI Agent

Strict editorial policy that focuses on accuracy, relevance, and impartiality Created by industry experts and meticulously reviewed The highest standards in reporting and publishing Strict editorial policy that focuses on accuracy, relevance, and impartiality Morbi pretium leo et nisl aliquam mollis. Quisque arcu lorem, ultricies quis pellentesque nec, ullamcorper eu odio. The coding community is buzzing with excitement as OpenAI announces the launch of Codex. It's a cloud-based software engineering agent built to lend a helping hand. Codex will allow developers to automate their work by tackling repetitive (but important) tasks on their behalf. These include fixing bugs, drafting documentation, scaffolding new features, and renaming, refactoring, and writing tests. Keep reading to learn more about Codex, including OpenAI's intent behind it and why it's a great example of where we're headed with AI agents. We'll also touch upon the growing popularity of AI agents in crypto and DeFi and discuss why MIND of Pepe is the best AI agent in crypto today. More About Codex Starting now, ChatGPT Pro, Enterprise, and Team users can access Codex on their dashboard. Codex will usher in a new era in vibe coding, which, in case you didn't know, is the practice of using AI tools for software engineering tasks. Unlike traditional coding, which can result in opaque software difficult to debug, Codex has been built to explain exactly what it's doing, which will help developers fix any future issues. We're about to undergo a pretty seismic shift in terms of how developers can be most accelerated by agents. – Alexander Embiricos, a member of OpenAI's product team working on agents Although you can already write code on ChatGPT, the Codex AI agent runs within a sandboxed environment in the cloud, allowing it to run commands and explore folders and even test the code autonomously. OpenAI aims to develop Codex as a 'virtual teammate' instead of just an AI assistant. The company says that it's already using the agent to automate repetitive tasks internally. The launch of Codex is perhaps the perfect opportunity to talk about the AI agent market, which has been on fire in 2025. This DeFi segment surged past $5B in total valuation in 2024, and experts expect it to swell to a brain-melting $47B in the next five years. Just this week, AI-focused cryptocurrencies increased by $10B in market capitalization. If you want to ride the growth of AI agents in crypto, MIND of Pepe could just be what you're looking for. After all, it's the perfect example to showcase just how powerful AI-crypto partnerships can get. What Is MIND of Pepe ($MIND)? $MIND is an autonomous AI agent armored with state-of-the-art hive-mind intelligence, which empowers it to assess social sentiments and market trends to identify the next cryptos to explode. To put it more neatly: $MIND is an AI agent that lives on dApps and online platforms like X. There, it chats with the crypto community, acknowledging their insights and opinions on various altcoins. It then uses its AI capabilities to study these data points and find out which cryptos could rally as a result of brewing market hype. It's worth noting that this AI agent's real-time crypto recommendations will only be available to $MIND token holders. The $MIND Presale Is Ending Soon Are you, too, guilty of scouring shady websites and scammy Discord/Telegram groups looking for the next big crypto coin? We know the feeling! Well, both your disappointment as a crypto scout and the color red in your crypto portfolio are going to be a thing of the past in less than two weeks from now when MIND of Pepe finally goes live. Speaking of the $MIND presale, it has had a fantastic run. Hundreds of thousands of investors have pooled over $9.4M in early funding, making MIND of Pepe one of the best crypto presales this year. The presale is coming to an end, though. With less than 14 days to go, this is your last chance to buy $MIND for such a low price – just $0.0037515 per token. If this is your first time buying a new meme coin on presale, here's our detailed guide on how to buy MIND of Pepe. The Benefits of Buying $MIND Are Endless In addition to receiving its expert crypto investment advice, $MIND presale token holders will also get exclusive access to the tokens the AI agent creates firsthand. You heard that right! MIND of Pepe, because it's self-evolving, will ultimately have the smarts to create cryptos from scratch. Naturally, these new cryptos will be based on what's trending in the market, meaning they'll be in a pole position to rocket to the moon. What's more, MIND of Pepe also has an extremely rewarding staking mechanism in place. At the time of writing, those who choose to stake their $MIND tokens will get 241% APY. Unlock all these benefits by becoming an early investor in MIND of Pepe today. Seeing as MIND of Pepe will bring about a massive shift in how the average crypto investor picks his portfolio, it should hardly be a surprise that our $MIND price prediction suggests that the token could reach $0.030 by the end of 2030. So, if you become a $MIND investor now for just $0.0037515 per token, you could potentially make 800% in less than five years.

What Is Codex? AI Coding Agent By OpenAI That May Replace Software Engineers
What Is Codex? AI Coding Agent By OpenAI That May Replace Software Engineers

NDTV

time17-05-2025

  • Business
  • NDTV

What Is Codex? AI Coding Agent By OpenAI That May Replace Software Engineers

OpenAI on Friday (May 16) announced the launch of Codex, the company's most capable artificial intelligence (AI) coding agent yet. Available to ChatGPT Pro, Enterprise, and Team subscribers, the software engineering agent runs in the cloud and can act as a "virtual coworker" for engineers, helping them write code, fix bugs -- all at an exceptional speed. OpenAI CEO Sam Altman took to social media to announce the research preview of the product that is powered by the latest o3 reasoning model. "Today we are introducing Codex. It is a software engineering agent that runs in the cloud and does tasks for you, like writing a new feature of fixing a bug. You can run many tasks in parallel," wrote Mr Altman on X (formerly Twitter). today we are introducing codex. it is a software engineering agent that runs in the cloud and does tasks for you, like writing a new feature of fixing a bug. you can run many tasks in parallel. — Sam Altman (@sama) May 16, 2025 As per OpenAI, Codec can "read and edit files, as well as run commands including test harnesses, linters, and type checkers". Depending on the complexity of the task, Codex takes typically anywhere between one to 30 minutes to complete the code. Codex is built to allow users to start multiple sessions at once, so they can have multiple agents working in parallel. How to use Codex? In order to use Codex, users need to simply go to the sidebar on ChatGPT. Assign the AI agent a new coding task by entering a prompt and clicking on 'Code'. During task execution, internet access is disabled, limiting the agent's interaction solely to the code explicitly provided via GitHub repositories. After the completion of an assigned task, Codex provides users with verifiable evidence of its actions via citations of terminal logs. When uncertain or faced with test failures, the Codex agent explicitly communicates these issues, enabling users to make informed decisions. Future of software engineers AI tools for software engineers have surged in popularity in recent months. Most IT companies have been claiming that writing code may become an archaic profession with AI taking over the role. The CEOs of tech behemoths such as Google and Microsoft have already claimed that roughly 30 per cent of their companies' code was now written by AI. The release of Codex might further accelerate the pace of AI-generated coding. Quizzed about what software engineering will look like 10 years from now, the Codex team suggested that speed and reliability of coding may go up, hinting towards increased use of AI. "We should be able to transform a reasonable specification of software we want into a working version of that software in a good timeframe and reliably," wrote Jerry Tworek, VP of Research at OpenAI, during an AMA on Reddit, with a user replying: "Allow me to translate into simple English: Software engineers should be scared and running to up-skill, like yesterday."

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store