Anthropic CEO claims AI models hallucinate less than humans

Yahoo23-05-2025

Anthropic CEO Dario Amodei believes today's AI models hallucinate, or make things up and present them as if they're true, at a lower rate than humans do, he said during a press briefing at Anthropic's first developer event, Code with Claude, in San Francisco on Thursday.
Amodei said all this in the midst of a larger point he was making: that AI hallucinations are not a limitation on Anthropic's path to AGI — AI systems with human-level intelligence or better.
"It really depends how you measure it, but I suspect that AI models probably hallucinate less than humans, but they hallucinate in more surprising ways," Amodei said, responding to TechCrunch's question.
Anthropic's CEO is one of the most bullish leaders in the industry on the prospect of AI models achieving AGI. In a widely circulated paper he wrote last year, Amodei said he believed AGI could arrive as soon as 2026. During Thursday's press briefing, the Anthropic CEO said he was seeing steady progress to that end, noting that "the water is rising everywhere."
"Everyone's always looking for these hard blocks on what [AI] can do," said Amodei. "They're nowhere to be seen. There's no such thing."
Other AI leaders believe hallucination presents a large obstacle to achieving AGI. Earlier this week, Google DeepMind CEO Demis Hassabis said today's AI models have too many "holes," and get too many obvious questions wrong. For example, earlier this month, a lawyer representing Anthropic was forced to apologize in court after they used Claude to create citations in a court filing, and the AI chatbot hallucinated and got names and titles wrong.
It's difficult to verify Amodei's claim, largely because most hallucination benchmarks pit AI models against each other; they don't compare models to humans. Certain techniques seem to be helping lower hallucination rates, such as giving AI models access to web search. Separately, some AI models, such as OpenAI's GPT-4.5, have notably lower hallucination rates on benchmarks compared to early generations of systems.
However, there's also evidence to suggest hallucinations are actually getting worse in advanced reasoning AI models. OpenAI's o3 and o4-mini models have higher hallucination rates than OpenAI's previous-gen reasoning models, and the company doesn't really understand why.
Later in the press briefing, Amodei pointed out that TV broadcasters, politicians, and humans in all types of professions make mistakes all the time. The fact that AI makes mistakes too is not a knock on its intelligence, according to Amodei. However, Anthropic's CEO acknowledged the confidence with which AI models present untrue things as facts might be a problem.
In fact, Anthropic has done a fair amount of research on the tendency for AI models to deceive humans, a problem that seemed especially prevalent in the company's recently launched Claude Opus 4. Apollo Research, a safety institute given early access to test the AI model, found that an early version of Claude Opus 4 exhibited a high tendency to scheme against humans and deceive them. Apollo went as far as to suggest Anthropic shouldn't have released that early model. Anthropic said it came up with some mitigations that appeared to address the issues Apollo raised.
Amodei's comments suggest that Anthropic may consider an AI model to be AGI, or equal to human-level intelligence, even if it still hallucinates. An AI that hallucinates may fall short of AGI by many people's definition, though.
This article originally appeared on TechCrunch at https://techcrunch.com/2025/05/22/anthropic-ceo-claims-ai-models-hallucinate-less-than-humans/

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Chief AI Scientist At Mark Zuckerberg's Meta Says 'No Way' Scaling ChatGPT-Like Models Is Going To Lead To Human-Level AI

Yahoo

40 minutes ago

Yahoo

Chief AI Scientist At Mark Zuckerberg's Meta Says 'No Way' Scaling ChatGPT-Like Models Is Going To Lead To Human-Level AI

Meta Platforms, Inc.'s (NASDAQ:META) chief AI scientist, Yann LeCun, says the tech industry won't close the gap to human-level intelligence by scaling today's large language models and piling on more parameters. What Happened: "We are not going to get to human-level AI by just scaling up LLMs. This is just not going to happen. There's no way — absolutely no way," LeCun told host Alex Kantrowitz on the Big Technology podcast in March. He dismissed bullish two-year timelines from "more adventurous colleagues" as "complete BS." Trending: Maker of the $60,000 foldable home has 3 factory buildings, 600+ houses built, and big plans to solve housing — In a clip of the podcast which was resurfaced on YouTube last week, LeCun likened current chatbots to "a system with a gigantic memory and retrieval ability, not a system that can invent solutions to new problems," adding that even if the models can answer most routine questions, "it's not a Ph.D. you have next to you." Instead of reasoning, he said, today's systems "pattern-match" the next word. LeCun contends the best path forward is collaborative. According to a report by Business Insider, at the AI Action Summit in Paris, which took place in February, he urged governments to contribute anonymized data to a larger open-source It Matters: LeCun has long doubted that OpenAI will win the race to artificial general intelligence (AGI), a stance he first voiced in December 2023. Last week, he pointed Elon Musk toward a new FAIR study on "Contextual Positional Encoding," telling the xAI founder it could boost Grok and then amplified the paper by sharing Meta researcher Jason Weston's explanatory thread on X. The exchange unfolded amid LeCun's running feud with Musk. After Musk posted xAI job openings on Monday, LeCun quipped that applicants should expect a boss who insists their project "will be solved next year." He later applauded Musk's engineering triumphs in cars, rockets, and satellites while slamming the billionaire's politics, conspiracy theories, and habitual hype. Read Next: Hasbro, MGM, and Skechers trust this AI marketing firm — Invest before it's too late. 'Scrolling To UBI' — Deloitte's #1 fastest-growing software company allows users to earn money on their phones. You can invest today for just $0.30/share with a $1000 minimum. Photo Courtesy: Tapati Rinchumrus on Up Next: Transform your trading with Benzinga Edge's one-of-a-kind market trade ideas and tools. Click now to access unique insights that can set you ahead in today's competitive market. Get the latest stock analysis from Benzinga? This article Chief AI Scientist At Mark Zuckerberg's Meta Says 'No Way' Scaling ChatGPT-Like Models Is Going To Lead To Human-Level AI originally appeared on Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

HIVE Digital Capacity Crosses 10 EH/s in May, Aims to More Than Double That by Year-End

Yahoo

an hour ago

Yahoo

HIVE Digital Capacity Crosses 10 EH/s in May, Aims to More Than Double That by Year-End

Bitcoin miner HIVE Digital Technologies (HIVE) has surpassed 10 exahash per second (EH/s) in hashrate capacity, a 58% increase from April, driven by the launch of a 100-megawatt hydro-powered site in Paraguay. The company said in a press release on Friday that it's on track to reach 25 EH/s by the end of 2025. The firm mined 139 bitcoin in May, or an average rate of 4.5 BTC per day. Peak capacity hit 10.4 EH/s while average hashrate for the month stood at 8.5 EH/s. HIVE said its fleet efficiency remained steady at around 20 joules per terahash (J/TH), and its network share now exceeds 1% of global Bitcoin mining power. The new facility in Paraguay reflects a broader trend in the mining industry: the race to deploy next-generation ASIC miners rapidly and at scale in regions with abundant renewable power. Co-founder Frank Holmes emphasized the company's speed and flexibility, pointing to its Buzz HPC division, which supports AI cloud infrastructure alongside Bitcoin mining. CEO Aydin Kilic said the company's goal for the summer is 18 EH/s, and that fleet upgrades should allow for a daily BTC output of over 12 by the fourth quarter — potentially at a production cost below $50,000 per coin. HIVE operates facilities in Canada, Sweden and Paraguay, powered entirely by hydroelectricity. The company was the first publicly listed crypto miner on the TSX Venture Exchange in 2017. HIVE shares are higher by 13% in New York trade on Friday as the mining sector rallies alongside bitcoin's gain to above $105,000.

Oncoscope Officially Launches, Ushering in a New Era of Real Time Oncology Intelligence

Associated Press

an hour ago

Associated Press

Oncoscope Officially Launches, Ushering in a New Era of Real Time Oncology Intelligence

06/06/2025, Miami, Florida // PRODIGY: Feature Story // Anna Forsythe, founder of Oncoscope-AI (source: Oncoscope-AI) Oncoscope-AI, a revolutionary oncology intelligence platform, has officially launched following a successful beta phase and over a year of strategic development that involved extensive conversations with practicing oncologists. The platform, which delivers real-time, human-curated cancer insights enhanced by artificial intelligence, is now live and available free of charge to verified healthcare professionals worldwide. Founded by Anna Forsythe, a pharmacist, health economist, and seasoned pharmaceutical executive, Oncoscope addresses a critical gap in oncology care. It gives clinicians instant access to the most current treatment data, FDA approvals, and guideline-aligned information, consolidated into one user-friendly platform. 'Doctors do not need more data. They need the right information, at the right time, in a format they can use to make better decisions for their patients,' said Forsythe. 'Oncoscope provides that clarity. It is a living library of oncology, curated by experts and built to save lives.' Unlike generic AI tools or static databases, Oncoscope uses trained AI to scan thousands of oncology publications and filters them through a rigorous, evidence-based framework. Each entry is cross-referenced with clinical guidelines and regulatory approvals to ensure usability and relevance. All of the results are carefully scrutinized by a team of experienced researchers. Currently, the platform supports breast and lung cancer, with prostate, bladder, colon, and rectal modules rolling out in the coming months. The process is intuitive. Physicians answer three clinical questions—cancer stage, genetic markers, and prior treatments—and receive a personalized, actionable summary. Each recommended article includes survival data, progression insights, treatment efficacy, and toxicity, extracted across 32 key clinical parameters. 'The result is something physicians can actually use in the moment,' said Forsythe. 'It takes three clicks to go from a patient in the room to the most up-to-date evidence in the field.' Access to Oncoscope is free for verified healthcare professionals, including physicians, nurses, pharmacists, genetic counselors, and physician assistants. Non-verified users, such as those in finance or consulting, can purchase limited access at a monthly rate, restricted to a single cancer type. This structure reflects the company's commitment to empowering front-line clinicians with better tools—without barriers. Forsythe, who previously founded and sold a successful health economics company serving global pharmaceutical clients, brings a rare combination of clinical, technical, and business expertise to this venture. She sees Oncoscope not only as a tool, but as a mission. 'This platform was born from both professional insight and personal urgency,' she said. 'Too many patients are still receiving outdated treatments, simply because their doctors do not have time to stay current. I realized I had the knowledge, the team, and the experience to fix that.' With a lean team, strategic vision, and a rapidly growing user base, Oncoscope is poised to become a trusted global resource in cancer treatment. 'We are not just a tech company,' said Forsythe. 'We are part of the oncology ecosystem. And we are here to help doctors deliver the best care possible.' For more information, visit Media Contact: Name - Anna Forsythe Email - [email protected] Source published by Submit Press Release >> Oncoscope Officially Launches, Ushering in a New Era of Real Time Oncology Intelligence

Anthropic CEO claims AI models hallucinate less than humans

Hashtags

Try Our AI Features

Comments

Related Articles

Chief AI Scientist At Mark Zuckerberg's Meta Says 'No Way' Scaling ChatGPT-Like Models Is Going To Lead To Human-Level AI

HIVE Digital Capacity Crosses 10 EH/s in May, Aims to More Than Double That by Year-End

Oncoscope Officially Launches, Ushering in a New Era of Real Time Oncology Intelligence

Get Started Now: Download the App