Anthropic announces Claude 4 Sonnet and Claude 4 Opus AI models, says they reason hard

India Today23-05-2025

Anthropic has launched two new AI models: Claude Opus 4 and Claude Sonnet 4. Anthropic says that these models are the best in the industry, and their highlight is their ability to reason. Along with that, Opus 4 and Sonnet 4 are both designed to improve coding and agent-like tasks. According to Anthropic, Claude Opus 4 is its most powerful model to date and is aimed at developers working on long and complex tasks. 'Claude Opus 4 is the world's best coding model, with sustained performance on complex, long-running tasks and agent workflows,' Anthropic writes on its blog post.advertisementClaude Sonnet 4, meanwhile, is described as a more practical, efficient upgrade from its predecessor, Claude Sonnet 3.7, and is now available to free-tier users. Opus 4 will only be available to paid subscribers of Claude.One of the key highlights, according to Anthropic, is Claude Opus 4's strong performance on coding benchmarks. The AI company claims that it scored 72.5 per cent on SWE-bench and 43.2 per cent on Terminal-bench. Which basically means that the model can reportedly work for several hours at a time without dropping performance, making it suitable for projects that require sustained attention.
Claude Sonnet 4 also shows improvement over previous versions, scoring 72.7 per cent on SWE-bench. While it doesn't match Opus 4 in overall capability, Anthropic says it strikes a better balance between speed and accuracy, which makes it suitable for broader, more everyday tasks.advertisementBoth models come with what Anthropic calls 'extended thinking' and tool use. This means they can pause reasoning, use tools like web search or code execution, and then resume their thought process. Tool use can now happen in parallel as well, which helps with more complex workflows.The models also introduce new memory features. If given access to local files, they can extract key facts and save them for future use. Anthropic says this helps the model build better long-terms memory and improve performance on tasks that require continuity.Anthropic has also announced four new API features, which are also rolling out. This includes a code execution tool, a connector for Anthropic's Multi-Component Programs (MCP), a Files API, and prompt caching for up to one hour. These updates aim to make it easier for developers to build AI agents that can take on more complex tasks.
Anthropic claims that Claude Opus 4 shows strong memory performance, particularly in agent-like settings. Anthropic says it gave the AI model file access during a game of Pokmon, and it was able to create a 'Navigation Guide' and maintain awareness of past actions. It also reduces the chance of using shortcuts or loopholes to complete tasks, something previous models were more prone to doing.To help users better understand how the models arrive at conclusions, Anthropic has added a new feature called 'thinking summaries.' These are short overviews of the model's reasoning process, generated by a smaller AI model. Full chains of thought are still available upon request through a Developer Mode.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Google chief Sundar Pichai explains why AI can't replace human coders just yet

Mint

16 hours ago

Mint

Google chief Sundar Pichai explains why AI can't replace human coders just yet

Artificial intelligence–powered tools like ChatGPT and Gemini are improving rapidly, and companies are now developing new agentic tools that could potentially replace humans in certain roles. The trend began earlier this year when OpenAI launched a Deep Research agent, which it claimed could replace entry-level research assistants. This was followed by OpenAI's Codex software engineering agent, and later, Google unveiled its own coding agent, Jules, at I/O 2025. Despite the arrival of these coding agents and a stark warning from Anthropic CEO Dario Amodei that 50% of all white-collar entry-level jobs could be eliminated within the next one to five years, Google CEO Sundar Pichai remains optimistic. During a recent podcast with Lex Fridman, Pichai said that while 30% of all code written at Google now uses AI help, the new technology is freeeing up humans do more and Google plans to hire more software engineers in the near to mid term. 'I think a few things. Looking at Google, we've given various stats around 30% of code now uses AI-generated solutions or whatever it is. But the most important metric, and we carefully measure this, how much has our engineering velocity increased as a company due to AI. And it's tough to measure, and we rigorously try to measure it. And our estimates are that number is now at 10%.' Pichai told the podcaster. 'Now, across the company, we've accomplished a 10% engineering velocity increase using AI. But we plan to hire more engineers next year. Because the opportunity space of what we can do is expanding too. And so, I think hopefully, at least in the near to mid-term, for many engineers, it frees up more and more of the, even in engineering and coding, there are aspects which are so much fun. You're designing, you're architecting, you're solving a problem. There's a lot of grunt work, which all goes hand in hand.' the Alphabet CEO added.

‘Frees up time to…': Sundar Pichai explains why AI won't replace human coders yet

Mint

19 hours ago

Mint

‘Frees up time to…': Sundar Pichai explains why AI won't replace human coders yet

Meta in talks for Scale AI investment that could top $10 billion

Time of India

21 hours ago

Time of India

Meta in talks for Scale AI investment that could top $10 billion

Meta Platforms Inc. is in talks to make a multibillion-dollar investment into artificial intelligence startup Scale AI , according to people familiar with the matter. The financing could exceed $10 billion in value, some of the people said, making it one of the largest private company funding events of all time. The terms of the deal are not finalized and could still change, according to the people, who asked not to be identified discussing private information. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Dukung Orang Terkasih Menghadapi Limfoma: Mulai Di Sini Limfoma Pelajari Undo A representative for Scale did not immediately respond to requests for comment. Meta declined to comment. Scale AI, whose customers include Microsoft Corp. and OpenAI, provides data labeling services to help companies train machine-learning models and has become a key beneficiary of the generative AI boom. The startup was last valued at about $14 billion in 2024, in a funding round that included backing from Meta and Microsoft. Earlier this year, Bloomberg reported that Scale was in talks for a tender offer that would value it at $25 billion. Live Events This would be Meta's biggest ever external AI investment, and a rare move for the company. The social media giant has before now mostly depended on its in-house research, plus a more open development strategy, to make improvements in its AI technology. Meanwhile, Big Tech peers have invested heavily: Microsoft has put more than $13 billion into OpenAI while both Inc. and Alphabet Inc. have put billions into rival Anthropic. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories Part of those companies' investments have been through credits to use their computing power. Meta doesn't have a cloud business, and it's unclear what format Meta's investment will take. Chief Executive Officer Mark Zuckerberg has made AI Meta's top priority, and said in January that the company would spend as much as $65 billion on related projects this year. The company's push includes an effort to make Llama the industry standard worldwide. Meta's AI chatbot — already available on Facebook, Instagram and WhatsApp — is used by 1 billion people per month. Scale, co-founded in 2016 by CEO Alexandr Wang, has been growing quickly: The startup generated revenue of $870 million last year and expects sales to more than double to $2 billion in 2025, Bloomberg previously reported. Scale plays a key role in making AI data available for companies. Because AI is only as good as the data that goes into it, Scale uses scads of contract workers to tidy up and tag images, text and other data that can then be used for AI training. Scale and Meta share an interest in defense tech. Last week, Meta announced a new partnership with defense contractor Anduril Industries Inc. to develop products for the US military, including an AI-powered helmet with virtual and augmented reality features. Meta has also granted approval for US government agencies and defense contractors to use its AI models. The company is already partnering with Scale on a program called Defense Llama — a version of Meta's Llama large language model intended for military use. Scale has increasingly been working with the US government to develop AI for defense purposes. Earlier this year the startup said it won a contract with the Defense Department to work on AI agent technology. The company called the contract 'a significant milestone in military advancement.'