OpenAI takes on Google, Anthropic with new AI agent for coders

The Star19-05-2025

OpenAI is rolling out a new artificial intelligence agent for ChatGPT users that's designed to help streamline software development as the company pushes into a crowded market of startups and large tech firms offering AI tools for coders.
The agent, called Codex, will be able to write software features, fix bugs and run tests, the company said in a blog post Friday. Codex, which is still in the early stages and has limited functionality, is geared towards workers with some technical knowledge and will first be released as a "research preview' to paid ChatGPT Pro, Enterprise and Team users.
A growing number of tech companies, including Microsoft Corp.-owned Github, Alphabet Inc.'s Google and Anthropic, offer AI tools for programmers. Some startups, including Cursor maker Anysphere and Windsurf, have also attracted users and investors with AI-infused coding assistants that can analyze a software developer's actions and suggest the next few lines.
In a sign of how important this emerging market is to the company, OpenAI is in talks to buy Windsurf for about $3 billion, Bloomberg News has reported. The deal would be the company's largest acquisition to date.
AI agents are billed as tools that can field more complex requests on behalf of users with minimal supervision. OpenAI said its technical staff are already using the coding agent daily for a range of work, from repetitive tasks to helping build new features. Other companies, including Cisco Systems Inc. and Kodiak Robotics, have also been using the tool, OpenAI said.
"We're just seeing very fast progress in the model's ability to solve coding and software engineering problems,' said Josh Tobin, research lead on agents at OpenAI. "We see this as a particularly fast way for us to get to that agents vision.'
Codex runs on a version of OpenAI's o3 AI reasoning model that is optimized for software engineering. The tool can take anywhere from one to 30 minutes to complete a task, depending on complexity.
OpenAI also said Codex was trained to identify and refuse requests aimed at the development of malicious software, a nod to concerns that bad actors could turn to more sophisticated coding agents for cyber attacks and other harmful uses. – Bloomberg

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

AI personal shoppers hunt down bargain buys

Sinar Daily

3 hours ago

Sinar Daily

AI personal shoppers hunt down bargain buys

NEW YORK – Internet giants are diving deeper into e-commerce with digital aides that know shoppers' preferences, let them virtually try on clothes, hunt for deals, and even place orders. The rise of virtual personal shoppers stems from generative artificial intelligence (AI) being deployed in "agents" that specialise in specific tasks and are granted autonomy to complete them independently. 'This is basically the next evolution of shopping experiences,' said CFRA Research analyst Angelo Zino. Google last week unveiled shopping features built into a new 'AI Mode'. It can take a person's own photo and blend it with that of a skirt, shirt, or other piece of clothing spotted online, showing how it would look on them. The AI adjusts the clothing size to fit, accounting for how fabrics drape, according to Google's Head of Advertising and Commerce, Vidhya Srinivasan. Shoppers can then set the price they are willing to pay and leave the AI to tirelessly browse the internet for a deal — alerting the shopper when one is found, and asking whether it should proceed with the purchase using Google's payment platform. 'They're taking on Amazon a little bit,' said Techsponential analyst Avi Greengart of Google. The tool is also a way to monetise AI by increasing online traffic and opportunities to display ads, Greengart added. The Silicon Valley tech titan did not respond to a query regarding whether it is sharing revenue from shopping transactions. Bartering bots? OpenAI added a shopping feature to ChatGPT earlier this year, enabling the chatbot to respond to requests with product suggestions, consumer reviews, and links to merchant websites. Perplexity AI began allowing subscribers to pay for online purchases without leaving its app late last year. In April, Amazon introduced a 'Buy for Me' mode to its Rufus digital assistant, enabling users to command it to make purchases on retailer websites outside Amazon's own platform. Walmart's Head of Technology, Hari Vasudev, recently spoke about adding an AI agent to the retail behemoth's online shopping portal, while also working with partners to ensure their digital agents prioritise Walmart products. Global payment networks Visa and Mastercard both announced in April that their systems had been modernised to enable payment transactions by digital agents. 'As AI agents start to take over the bulk of product discovery and the decision-making process, retailers must consider how to optimise for this new layer of AI shoppers,' said Elise Watson of Clarkston Consulting. Retailers are likely to be left in the dark when it comes to what makes a product attractive to AI agents, according to Watson. Knowing the customer Zino does not expect AI shoppers to trigger an upheaval in the e-commerce industry, but he does see the technology benefiting Google and Meta. Not only do the internet rivals possess vast amounts of data about their users, but they are also among the frontrunners in the AI race. 'They probably have more information on the consumer than anyone else out there,' Zino said of Google and Meta. Technology firms' access to user data touches on the hot-button issue of online privacy and who should control personal information. Google plans to refine consumer profiles based on search activity and promises that shoppers will need to authorise access to additional information, such as emails or app usage. Trusting a chatbot with purchasing decisions may alarm some users, and while the technology may be in place, the legal and ethical framework is not yet fully developed. 'The agent economy is here,' said PSE Consulting Managing Director Chris Jones. 'The next phase of e-commerce will depend on whether we can trust machines to buy on our behalf.' - AFP

Hey chatbot, is this true? AI's answer: not really, say fact-checkers

Malay Mail

4 hours ago

Malay Mail

Hey chatbot, is this true? AI's answer: not really, say fact-checkers

WASHINGTON, June 2 — As misinformation exploded during India's four-day conflict with Pakistan, social media users turned to an AI chatbot for verification — only to encounter more falsehoods, underscoring its unreliability as a fact-checking tool. With tech platforms reducing human fact-checkers, users are increasingly relying on AI-powered chatbots — including xAI's Grok, OpenAI's ChatGPT, and Google's Gemini — in search of reliable information. 'Hey @Grok, is this true?' has become a common query on Elon Musk's platform X, where the AI assistant is built in, reflecting the growing trend of seeking instant debunks on social media. But the responses are often themselves riddled with misinformation. Grok — now under renewed scrutiny for inserting 'white genocide,' a far-right conspiracy theory, into unrelated queries — wrongly identified old video footage from Sudan's Khartoum airport as a missile strike on Pakistan's Nur Khan airbase during the country's recent conflict with India. Unrelated footage of a building on fire in Nepal was misidentified as 'likely' showing Pakistan's military response to Indian strikes. 'The growing reliance on Grok as a fact-checker comes as X and other major tech companies have scaled back investments in human fact-checkers,' McKenzie Sadeghi, a researcher with the disinformation watchdog NewsGuard, told AFP. 'Our research has repeatedly found that AI chatbots are not reliable sources for news and information, particularly when it comes to breaking news,' she warned. 'Fabricated' NewsGuard's research found that 10 leading chatbots were prone to repeating falsehoods, including Russian disinformation narratives and false or misleading claims related to the recent Australian election. In a recent study of eight AI search tools, the Tow Centre for Digital Journalism at Columbia University found that chatbots were 'generally bad at declining to answer questions they couldn't answer accurately, offering incorrect or speculative answers instead.' When AFP fact-checkers in Uruguay asked Gemini about an AI-generated image of a woman, it not only confirmed its authenticity but fabricated details about her identity and where the image was likely taken. Grok recently labelled a purported video of a giant anaconda swimming in the Amazon River as 'genuine,' even citing credible-sounding scientific expeditions to support its false claim. In reality, the video was AI-generated, AFP fact-checkers in Latin America reported, noting that many users cited Grok's assessment as evidence the clip was real. Such findings have raised concerns as surveys show that online users are increasingly shifting from traditional search engines to AI chatbots for information gathering and verification. The shift also comes as Meta announced earlier this year it was ending its third-party fact-checking program in the United States, turning over the task of debunking falsehoods to ordinary users under a model known as 'Community Notes,' popularized by X. Researchers have repeatedly questioned the effectiveness of 'Community Notes' in combating falsehoods. 'Biased answers' Human fact-checking has long been a flashpoint in a hyperpolarized political climate, particularly in the United States, where conservative advocates maintain it suppresses free speech and censors right-wing content — something professional fact-checkers vehemently reject. AFP currently works in 26 languages with Facebook's fact-checking program, including in Asia, Latin America, and the European Union. The quality and accuracy of AI chatbots can vary, depending on how they are trained and programmed, prompting concerns that their output may be subject to political influence or control. Musk's xAI recently blamed an 'unauthorized modification' for causing Grok to generate unsolicited posts referencing 'white genocide' in South Africa. When AI expert David Caswell asked Grok who might have modified its system prompt, the chatbot named Musk as the 'most likely' culprit. Musk, the South African-born billionaire backer of President Donald Trump, has previously peddled the unfounded claim that South Africa's leaders were 'openly pushing for genocide' of white people. 'We have seen the way AI assistants can either fabricate results or give biased answers after human coders specifically change their instructions,' Angie Holan, director of the International Fact-Checking Network, told AFP. 'I am especially concerned about the way Grok has mishandled requests concerning very sensitive matters after receiving instructions to provide pre-authorized answers.' — AFP

Hey chatbot, is this true? AI ‘factchecks' sow misinformation

The Sun

5 hours ago

The Sun

Hey chatbot, is this true? AI ‘factchecks' sow misinformation

WASHINGTON: As misinformation exploded during India's four-day conflict with Pakistan, social media users turned to an AI chatbot for verification -- only to encounter more falsehoods, underscoring its unreliability as a fact-checking tool. With tech platforms reducing human fact-checkers, users are increasingly relying on AI-powered chatbots -- including xAI's Grok, OpenAI's ChatGPT, and Google's Gemini -- in search of reliable information. 'Hey @Grok, is this true?' has become a common query on Elon Musk's platform X, where the AI assistant is built in, reflecting the growing trend of seeking instant debunks on social media. But the responses are often themselves riddled with misinformation. Grok -- now under renewed scrutiny for inserting 'white genocide,' a far-right conspiracy theory, into unrelated queries -- wrongly identified old video footage from Sudan's Khartoum airport as a missile strike on Pakistan's Nur Khan airbase during the country's recent conflict with India. Unrelated footage of a building on fire in Nepal was misidentified as 'likely' showing Pakistan's military response to Indian strikes. 'The growing reliance on Grok as a fact-checker comes as X and other major tech companies have scaled back investments in human fact-checkers,' McKenzie Sadeghi, a researcher with the disinformation watchdog NewsGuard, told AFP. 'Our research has repeatedly found that AI chatbots are not reliable sources for news and information, particularly when it comes to breaking news,' she warned. - 'Fabricated' - NewsGuard's research found that 10 leading chatbots were prone to repeating falsehoods, including Russian disinformation narratives and false or misleading claims related to the recent Australian election. In a recent study of eight AI search tools, the Tow Center for Digital Journalism at Columbia University found that chatbots were 'generally bad at declining to answer questions they couldn't answer accurately, offering incorrect or speculative answers instead.' When AFP fact-checkers in Uruguay asked Gemini about an AI-generated image of a woman, it not only confirmed its authenticity but fabricated details about her identity and where the image was likely taken. Grok recently labeled a purported video of a giant anaconda swimming in the Amazon River as 'genuine,' even citing credible-sounding scientific expeditions to support its false claim. In reality, the video was AI-generated, AFP fact-checkers in Latin America reported, noting that many users cited Grok's assessment as evidence the clip was real. Such findings have raised concerns as surveys show that online users are increasingly shifting from traditional search engines to AI chatbots for information gathering and verification. The shift also comes as Meta announced earlier this year it was ending its third-party fact-checking program in the United States, turning over the task of debunking falsehoods to ordinary users under a model known as 'Community Notes,' popularized by X. Researchers have repeatedly questioned the effectiveness of 'Community Notes' in combating falsehoods. - 'Biased answers' - Human fact-checking has long been a flashpoint in a hyperpolarized political climate, particularly in the United States, where conservative advocates maintain it suppresses free speech and censors right-wing content -- something professional fact-checkers vehemently reject. AFP currently works in 26 languages with Facebook's fact-checking program, including in Asia, Latin America, and the European Union. The quality and accuracy of AI chatbots can vary, depending on how they are trained and programmed, prompting concerns that their output may be subject to political influence or control. Musk's xAI recently blamed an 'unauthorized modification' for causing Grok to generate unsolicited posts referencing 'white genocide' in South Africa. When AI expert David Caswell asked Grok who might have modified its system prompt, the chatbot named Musk as the 'most likely' culprit. Musk, the South African-born billionaire backer of President Donald Trump, has previously peddled the unfounded claim that South Africa's leaders were 'openly pushing for genocide' of white people. 'We have seen the way AI assistants can either fabricate results or give biased answers after human coders specifically change their instructions,' Angie Holan, director of the International Fact-Checking Network, told AFP. 'I am especially concerned about the way Grok has mishandled requests concerning very sensitive matters after receiving instructions to provide pre-authorized answers.'