Mirror Mirror, on the wall, who hallucinates the most of all?: Anthropic's CEO claims humans hallucinate more than AI, boasting the new model's factual reliability.

Live Events
CEO Dario Amodei , speaking at the VivaTech 2025 in Paris and the 'Inaugural Code with Claude' developer day, claimed that AI can now outperform human beings in terms of factual accuracy in structured scenarios. He asserts in the aforementioned major tech events of this month that modern AI models, including the newly released Claude 4 series , may hallucinate at a lesser rate than most humans when answering factual and structured questions.In the context of AI, hallucination refers to when AI tools such as ChatGPT, Gemini, Copilot, or even Claude misinterpret commands, data, and context. Upon misinterpreting, it creates gaps in knowledge, wherein the AI tool begins to fill those gaps with assumptions, which aren't always factual or even real at times. Simply put, it is the generation of fabricated information.However, with recent advancements, Amodei plants a suggestion that the situation has turned the other way around, although mostly so in conditions that can be deemed 'controlled.'During Amodei's keynote at VivaTech, he cited Anthropic 's internal testing, where they demonstrated Claude 3.5's factual accuracy using structured factual quizzes in competition with human participants. The test garnered results that proved a notable shift in reliability when it comes to factual precision, at least so in straightforward question-answer tasks.He further insists on his stance, reportedly at the developer-focused 'Code with Claude' event, where the Claude Opus 4 and Claude Sonnet 4 models were unveiled, that factual accuracy in AI models depends severely upon the prompt design, context, and domain-specific application. Particularly in high-stakes environments like legal filings or healthcare. He stressed this statement whilst acknowledging the recent legal dispute involving Claude's confabulations.The CEO also promptly admits to not having the 'hallucinations' completely eradicated and understands that the model still remains vulnerable to error but can be used with optimum accuracy with the right information fed to the model.While modern AI models like the new Claude 4 series are steadily advancing toward factual precision, especially in structured tasks, their reliability still depends on proper and careful use. As Amodei suggested, prompt design and domain context remain critical. In this ongoing competition between human intelligence and artificial intelligence, one thing is certain: it isn't merely us who hold the key to the answers; rather, we share the test with the machines.

Hashtags

#Claude

#Claude4

#VivaTech2025

#InauguralCodewithClaude

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Bengaluru ranks 14 in global startups, fifth in AI and big data

New Indian Express

29 minutes ago

New Indian Express

Bengaluru ranks 14 in global startups, fifth in AI and big data

BENGALURU: Bengaluru has jumped seven spots to rank 14 in the Global Startup Ecosystem Report (GSER) 2025, up from 21 last year. The city ranked fifth globally in AI & Big Data. The GSER 2025, published by Startup Genome and released at VivaTech 2025 in Paris, evaluates ecosystems across key parameters, performance, funding, market reach, talent and experience, and knowledge along with a new focus on AI-native capabilities. At VivaTech 2025, Karnataka IT-BT Minister Priyank Kharge spoke at a high-level panel discussion on AI disruption and startup ecosystem futures. 'This ranking is not just a number; it reflects the structural resilience and readiness of Karnataka's innovation economy,' Kharge stated. He detailed Karnataka's multi-pronged strategy; including Innoverse open innovation platform, Beyond Bengaluru regional startup mission, and Nipuna Karnataka, a skilling initiative targeting over a million professionals. The state is also fostering enterprise-startup collaboration via its growing Global Capability Center (GCC) network and supporting Deep Tech with dedicated funding for AI, biotech and robotics. Bengaluru now stands shoulder-to-shoulder with global hubs like Paris (rank 12), Philadelphia (13), and Seattle (15), marking a significant shift in global innovation trends and placing Karnataka firmly on the global Deep Tech map.

Google, Scale AI's largest customer, plans split after Meta deal

Time of India

32 minutes ago

Time of India

Google, Scale AI's largest customer, plans split after Meta deal

Alphabet's Google, the largest customer of Scale AI, plans to cut ties with Scale after news broke that rival Meta is taking a 49% stake in the AI data-labeling startup, five sources familiar with the matter told Reuters. Google had planned to pay Scale AI about $200 million this year for the human-labeled training data that is crucial for developing technology, including the sophisticated AI models that power Gemini, its ChatGPT competitor, one of the sources said. The search giant already held conversations with several of Scale AI's rivals this week as it seeks to shift away much of that workload, sources added. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like 最強のヒーローチームを編成する力はありますか? レイドシャドウレジェンド今すぐインストール Undo Scale's loss of significant business comes as Meta takes a big stake in the company, valuing it at $29 billion. Scale was worth $14 billion before the deal. Scale AI intends to keep its business running while its CEO, Alexandr Wang, along with a few employees, move over to Meta. Since its core business is concentrated around a few customers, it could suffer greatly if it loses key customers like Google. In a statement, a Scale AI spokesperson said its business, which spans work with major companies and governments, remains strong, as it is committed to protecting customer data. The company declined to comment on specifics with Google. Live Events Scale AI raked in $870 million in revenue in 2024, and Google spent some $150 million on Scale AI's services last year, sources said. Other major tech companies that are customers of Scale's, including Microsoft, are also backing away. Elon Musk's xAI is also looking to exit, one of the sources said. OpenAI decided to pull back from Scale several months ago, according to sources familiar with the matter, though it spends far less money than Google. OpenAI's CFO that the company will continue to work with Scale AI, as one of its many data vendors. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories Companies that compete with Meta in developing cutting-edge AI models are concerned that doing business with Scale could expose their research priorities and road map to a rival, five sources said. By contracting with Scale AI, customers often share proprietary data as well as prototype products for which Scale's workers are providing data-labeling services. With Meta now taking a 49% stake, AI companies are concerned that one of their chief rivals could gain knowledge about their business strategy and technical blueprints. Google, Microsoft and OpenAI declined to comment. xAI did not respond to a request for comment. Rivals see openings The bulk of Scale AI's revenue comes from charging generative AI model makers for providing access to a network of human trainers with specialized knowledge - from historians to scientists, some with doctorate degrees. The humans annotate complex datasets that are used to "post-train" AI models, and as AI models have become smarter, the demand for the sophisticated human-provided examples has surged, and one annotation could cost as much as $100. Scale also does data-labeling for enterprises like self-driving car companies and the US government, which are likely to stay, according to the sources. But its biggest money-maker is in partnering with generative AI model makers, the sources said. Google had already sought to diversify its data service providers for more than a year, three of the sources said. But Meta's moves this week have led Google to seek to move off Scale AI on all its key contracts, the sources added. Because of the way data-labeling contracts are structured, that process could happen quickly, two sources said. This will provide an opening for Scale AI's rivals to jump in. "The Meta-Scale deal marks a turning point," said Jonathan Siddharth, CEO of Turing, a Scale AI competitor. "Leading AI labs are realizing neutrality is no longer optional, it's essential." Labelbox, another competitor, will "probably generate hundreds of millions of new revenue" by the end of the year from customers fleeing Scale, its CEO, Manu Sharma, told Reuters. Handshake, a competitor focusing on building a network of PhDs and experts, saw a surge of workload from top AI labs that compete with Meta. "Our demand has tripled overnight after the news," said Garrett Lord, CEO at Handshake. Many AI labs now want to hire in-house data-labelers, which allows their data to remain secure, said Brendan Foody, CEO of Mercor, a startup that in addition to competing directly with Scale AI also builds technology around being able to recruit and vet candidates in an automated way, enabling AI labs to scale up their data labeling operations quickly. Founded in 2016, Scale AI provides vast amounts of labeled data or curated training data, which is crucial for developing sophisticated tools such as OpenAI's ChatGPT. The Meta deal will be a boon for Scale AI's investors including Accel and Index Ventures, as well as its current and former employees. As part of the deal, Scale AI's CEO, Wang, will take a top position leading Meta's AI efforts. Meta is fighting the perception that it may have fallen behind in the AI race after its initial set of Llama 4 large language models released in April fell short of performance expectations.

Meta's $14.8 billion Scale AI deal latest test of AI partnerships

Time of India

40 minutes ago

Time of India

Meta's $14.8 billion Scale AI deal latest test of AI partnerships

Facebook owner Meta's $14.8 billion investment in Scale AI and hiring of the data-labeling startup's CEO will test how the Trump administration views so-called acquihire deals , which some have criticized as an attempt to evade regulatory scrutiny. The deal, announced on Thursday, was Meta's second-largest investment to date. It gives the owner of Facebook a 49% nonvoting stake in Scale AI, which uses gig workers to manually label data and includes among its customers Meta competitors Microsoft and ChatGPT creator OpenAI. Unlike an acquisition or a transaction that would give Meta a controlling stake, the deal does not require a review by US antitrust regulators. However, they could probe the deal if they believe it was structured to avoid those requirements or harm competition. The deal appeared to be structured to avoid potential pitfalls, such as cutting off competitors' access to Scale's services or giving Meta an inside view into rivals' operations - though Reuters exclusively reported on Friday that Alphabet's Google has decided to sever ties with Scale in light of Meta's stake, and other customers are looking at taking a step back. In a statement, a Scale AI spokesperson said its business, which spans work with major companies and governments, remains strong, as it is committed to protecting customer data. The company declined to comment on specifics with Google. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Elegant New Scooters For Seniors In 2024: The Prices May Surprise You Mobility Scooter | Search Ads Learn More Undo Alexandr Wang, Scale's 28-year-old CEO who is coming to Meta as part of the deal, will remain on Scale's board but will have appropriate restrictions placed around his access to information, two sources familiar with the move confirmed. Large tech companies likely perceive the regulatory environment for AI partnerships as easier to navigate under President Donald Trump than under former President Joe Biden, said William Kovacic, director of the competition law center at George Washington University. Trump's antitrust enforcers have said they do not want to regulate how AI develops, but have also displayed a suspicion of large tech platforms, he added. Live Events "That would lead me to think they will keep looking carefully at what the firms do. It does not necessarily dictate that they will intervene in a way that would discourage the relationships," Kovacic said. Federal Trade Commission probes into past "aquihire" deals appear to be at a standstill. Under the Biden administration, the FTC opened inquiries into Amazon's deal to hire top executives and researchers from AI startup Adept, and Microsoft's $650 million deal with Inflection AI. The latter allowed Microsoft to use Inflection's models and hire most of the startup's staff, including its co-founders. Amazon's deal closed without further action from the regulator, a source familiar with the matter confirmed. And, more than a year after its initial inquiry, the FTC has so far taken no enforcement action against Microsoft over Inflection, though a larger probe over practices at the software giant is ongoing. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories A spokesperson for the FTC declined to comment on Friday. David Olson, a professor who teaches antitrust law at Boston College Law School, said it was smart of Meta to take a minority nonvoting stake. "I think that does give them a lot of protection if someone comes after them," he said, adding that it was still possible that the FTC would want to review the agreement. The Meta deal has its skeptics. US Senator Elizabeth Warren, a Democrat from Massachusetts who is probing AI partnerships involving Microsoft and Google, said Meta's investment should be scrutinized. "Meta can call this deal whatever it wants - but if it violates federal law because it unlawfully squashes competition or makes it easier for Meta to illegally dominate, antitrust enforcers should investigate and block it," she said in a statement on Friday. While Meta faces its own monopoly lawsuit by the FTC, it remains to be seen whether the agency will have any questions about its Scale investment. The US Department of Justice's antitrust division, led by former JD Vance adviser Gail Slater, recently started looking into whether Google's partnership with chatbot creator was designed to evade antitrust review, Bloomberg News reported. The DOJ is separately seeking to make Google give it advance notice of new AI investments as part of a proposal to curb the company's dominance in online search.