logo
Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry

Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry

Yahoo4 hours ago

Researchers at Apple have released an eyebrow-raising paper that throws cold water on the "reasoning" capabilities of the latest, most powerful large language models.
In the paper, a team of machine learning experts makes the case that the AI industry is grossly overstating the ability of its top AI models, including OpenAI's o3, Anthropic's Claude 3.7, and Google's Gemini.
In particular, the researchers assail the claims of companies like OpenAI that their most advanced models can now "reason" — a supposed capability that the Sam Altman-led company has increasingly leaned on over the past year for marketing purposes — which the Apple team characterizes as merely an "illusion of thinking."
It's a particularly noteworthy finding, considering Apple has been accused of falling far behind the competition in the AI space. The company has chosen a far more careful path to integrating the tech in its consumer-facing products — with some seriously mixed results so far.
In theory, reasoning models break down user prompts into pieces and use sequential "chain of thought" steps to arrive at their answers. But now, Apple's own top minds are questioning whether frontier AI models simply aren't as good at "thinking" as they're being made out to be.
"While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scaling properties, and limitations remain insufficiently understood," the team wrote in its paper.
The authors — who include Samy Bengio, the director of Artificial Intelligence and Machine Learning Research at the software and hardware giant — argue that the existing approach to benchmarking "often suffers from data contamination and does not provide insights into the reasoning traces' structure and quality."
By using "controllable puzzle environments," the team estimated the AI models' ability to "think" — and made a seemingly damning discovery.
"Through extensive experimentation across diverse puzzles, we show that frontier [large reasoning models] face a complete accuracy collapse beyond certain complexities," they wrote.
Thanks to a "counter-intuitive scaling limit," the AIs' reasoning abilities "declines despite having an adequate token budget."
Put simply, even with sufficient training, the models are struggling with problem beyond a certain threshold of complexity — the result of "an 'overthinking' phenomenon," in the paper's phrasing.
The finding is reminiscent of a broader trend. Benchmarks have shown that the latest generation of reasoning models is more prone to hallucinating, not less, indicating the tech may now be heading in the wrong direction in a key way.
Exactly how reasoning models choose which path to take remains surprisingly murky, the Apple researchers found.
"We found that LRMs have limitations in exact computation," the team concluded in its paper. "They fail to use explicit algorithms and reason inconsistently across puzzles."
The researchers claim their findings raise "crucial questions" about the current crop of AI models' "true reasoning capabilities," undercutting a much-hyped new avenue in the burgeoning industry.
That's despite tens of billions of dollars being poured into the tech's development, with the likes of OpenAI, Google, and Meta, constructing enormous data centers to run increasingly power-hungry AI models.
Could the Apple researchers' finding be yet another canary in the coalmine, suggesting the tech has "hit a wall"?
Or is the company trying to hedge its bets, calling out its outperforming competition as it lags behind, as some have suggested?
It's certainly a surprising conclusion, considering Apple's precarious positioning in the AI industry: at the same time that its researchers are trashing the tech's current trajectory, it's promised a suite of Apple Intelligence tools for its devices like the iPhone and MacBook.
"These insights challenge prevailing assumptions about LRM capabilities and suggest that current approaches may be encountering fundamental barriers to generalizable reasoning," the paper reads.
More on AI models: Car Dealerships Are Replacing Phone Staff With AI Voice Agents

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Amazon wants to become a global marketplace for AI
Amazon wants to become a global marketplace for AI

Yahoo

time13 minutes ago

  • Yahoo

Amazon wants to become a global marketplace for AI

Amazon Web Services isn't betting on one large language model (LLM) winning the artificial intelligence race. Instead, it's offering customers a buffet of models to choose from. AWS, the cloud computing arm of Amazon (AMZN), aims to become the go-to infrastructure layer for the AI economy, regardless of which model wins out. By making customer choice a defining principle, AWS hopes to win out against rivals that have aligned closely with specific LLM providers — notably Microsoft (MSFT), which partnered with ChatGPT creator OpenAI ( 'We don't think that there's going to be one model to rule them all,' Dave Brown, vice president of compute and networking at AWS, told Yahoo Finance. The model-neutral approach is embedded into Amazon Bedrock, a service that allows AWS customers to build their own applications using a wide range of models, with more than 100 to choose from. Brown added that after Chinese startup DeepSeek surprised the world, AWS had a fully managed version of the disruptive model available on Bedrock within a week. Two years after its launch, Bedrock is now the fastest-growing service offered by AWS, which accounted for over 18% of Amazon's total revenue in the first quarter. It's why Amazon CEO Andy Jassy sees Bedrock as a core part of the company's AI growth strategy. But to understand the competitive advantage AWS hopes to offer with Bedrock, you have to go back to its origin story. Bedrock dates back to a six-page internal memo that Atul Deo, AWS's director of product management, wrote in 2020. Before OpenAI's ChatGPT launched in 2022 and made 'generative AI' a household term, Deo pitched a service that could generate code from plain English prompts using large language models. But Jassy, the head of AWS at the time, didn't buy it. 'His initial reaction was, 'This seems almost like a pipe dream,'' Deo said. He added that while a tool that makes coding easy sounds obvious now, the technology was 'still not quite there.' When that project, initially known as Code Whisperer, launched in 2023, the team realized they could offer the service for a broader set of use cases, giving customers a choice of different models with 'generic capabilities' that 'could be used as a foundation to build a lot of interesting applications,' according to Deo. Deo noted that the team steered away from doubling down on its own model after it recognized a pattern of customers wanting choice in other AWS services. This led to AWS becoming the first provider to offer a range of different models to customers. With this foundational approach in mind, Amazon renamed the project Bedrock. To be sure, the model-agnostic approach has risks, and many analysts don't consider Amazon to be leading the AI race, even though it has ramped up its AI spending. If there is ultimately one model to rule them all, similar to how Google came to dominate search, Amazon could risk further falling behind. At the beginning of the year, Amazon and its peers Meta (META), Microsoft, and Google parent Alphabet (GOOG) expected to spend $325 billion combined, mostly on AI infrastructure. To keep pace, Amazon has hedged its bets with its own technology and one LLM provider in particular: Anthropic. In November 2024, AWS doubled its investment in Anthropic to $8 billion in a deal that requires Anthropic to train its large language model, Claude, using only AWS's chips. (For comparison, Microsoft has invested over $13 billion into OpenAI.) The $8 billion deal allows Amazon to prove out its AI training infrastructure and deepen ties with one LLM provider while continuing to offer customers a wide selection of models on Bedrock. 'I mean, this is cloud selling 101, right?' said Dan Rosenthal, head of go-to-market partnerships at Anthropic. 'There are some cases where it's been very clear that a customer wants to use a different model on Bedrock for something that we just frankly don't focus on, and that's great. We want to win where we have a right to win.' Amazon also launched its own family of foundational models, called Nova, at the end of 2024, two years after the launch of ChatGPT. But competition and expectations remain high: Revenue at AWS increased 16.9% to $29.27 billion in Q1, marking the third time in a row it missed analyst estimates despite double-digit growth. The Anthropic partnership also underscores a bigger competition AWS may be fighting with chipmakers, including Nvidia (NVDA), which recently staged a $1 trillion rally in just two months after an earnings print that eased investor concerns about chip export controls. While Amazon is an Nvidia customer, it also produces highly effective and more affordable AI chips based on power consumed (known as 'price performance'). On Bedrock, AWS lets clients choose whether to use its own CPUs and GPUs or chips from competitors like Intel (INTC), AMD (AMD), and Nvidia. 'We're able to work with the model providers to really optimize the model for the hardware that it runs,' Brown said. 'There's no change the customer has to make.' Customers not only have a choice of model but also a choice of which infrastructure the model should run and train on. This helps AWS compete on price — a key battleground with Nvidia, which offers the most expensive chips on the market. This 'coopetition' dynamic could position Amazon to take market share from Nvidia if it can prove its own chips can do the job for a lower sticker price. It's a bet that Amazon is willing to spend on, with capital expenditures expected to hit $100 billion in 2025, up from $83 billion last year. While AWS doesn't break out its costs for AI, CEO Andy Jassy said on an earnings call in February that the 'vast majority of that capex spend is on AI for AWS.' In an April letter to shareholders, Jassy noted that 'AI revenue is growing at triple-digit YoY percentages and represents a multibillion-dollar annual revenue run rate.' Sign in to access your portfolio

iOS 26 is here — how to download the developer beta
iOS 26 is here — how to download the developer beta

Tom's Guide

time15 minutes ago

  • Tom's Guide

iOS 26 is here — how to download the developer beta

Apple's not wasting any time getting the newly unveiled iOS 26 on to people's phones. Immediately after previewing the upcoming software update today (June 9) at WWDC 2025, Apple released an iOS 26 developer beta. And you don't even have to be a developer to download it. iOS 26 introduces a new Liquid Glass design to the iPhone, the same interface overhaul coming to Apple's other software platforms this year. But there are other changes to familiar apps, including the Phone, Messages, Camera and Maps app, among others. In addition, iOS 26 will see the launch of an Apple Games app for managing your mobile gaming in one location. If you can't wait to see iOS 26 in action for yourself, we'll walk you through the steps of downloading the developer beta. That said, as we'll discuss below, you may want to wait for the iOS 26 public beta, which is set to arrive in July. The full iOS 26 release will come to iPhones in the fall, likely around the same time that the upcoming iPhone 17 lineup debuts. Here's what you need to know about the iOS 26 developer beta. While Apple intends its developer betas to be used by developers for updating their own software to run on the new operating system, a change in policy a few years ago means that anyone with an Apple ID can download developer betas like the one for iOS 26. You'll also have to enroll in Apple's developer program, which has a free tier. To enroll, go to the enrollment page on Apple's developer website and select "Start Your Enrollment." You'll be prompted to sign in with an Apple ID — use the same Apple ID associated with the iPhone where you plan to install the beta. You'll also have to be enrolled in Apple's beta program. Go to the Apple beta program website on your iPhone and select Sign Up. On the next page select Enroll Your iOS device then tap Open Beta Updates. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. You'll jump to a screen in the Settings app where you'll be prompted to turn on beta updates. This makes it possible for Apple's software betas to be available for download on your phone. Before you install the iOS 26 developer beta, make sure you have a phone that supports the new software. The same iPhones that use the current iOS 18 also support iOS 26 — with three notable exceptions. The iPhone XR, iPhone XS and iPhone XS Max may run iOS 18 just fine, but they won't be able to run iOS 26. That's understandable as the phones did first ship in 2018, and that's at the outer edge of Apple's support window. Even though the iOS 26 developer beta will run on the iPhone 11 or later, you'll still need a phone with at least an A17 Bionic chipset to take advantage of any Apple Intelligence features in the new software. That means you need an iPhone 15 Pro, iPhone 15 Pro Max or any iPhone 16 including the iPhone 16e. Once you've enrolled in the developer program and backed up your iPhone, it's time to get that developer beta on your device. In the Settings app on the phone you wish to update, select General and then on the ensuing screen, tap Software Update. On the Software Update screen, select Beta Updates. From the list on the next screen, select iOS 26 Developer Beta; then, hit the Back button. The iOS 26 Beta will now appear as a downloadable option. Tap Update Now and follow the onscreen instructions. To paraphrase Jeff Goldblum from the first "Jurassic Park" movie, don't be so preoccupied that you can download the iOS 26 developer beta to not consider whether you should download it. Beta software is exactly that — it's unfinished and unproven. There could be bugs in it, and some of the apps you rely on regularly may not function properly, especially in early betas. For that reason, I always advise people to not install beta software on any device they depend on for their daily use. Instead, use a backup iPhone if you have one lying around. And if you don't, maybe consider waiting a month until the public beta arrives, as that version of iOS 26 figures to be more stable.

The biggest changes coming to your iPhone with iOS 26
The biggest changes coming to your iPhone with iOS 26

The Verge

time15 minutes ago

  • The Verge

The biggest changes coming to your iPhone with iOS 26

Apple just announced the next major software update for iPhones: iOS 26 (a jump from what, until recently, was expected to be called iOS 19), and it's packed with a whole bunch of new features for your phone. The biggest change is a new design, but there are lots of smaller improvements and additions as well that could make a difference in how you use your iPhone every day. Here's a bit more detail on what you can expect from iOS 26 when it releases for everyone this fall. If you want to try it early, Apple has already launched a developer beta, and it will offer a public beta sometime in July. A refreshed design across the OS Apple has a new design language called 'Liquid Glass' that it's being introduced across all of the company's devices, not just the iPhone. It's inspired by the visionOS software used with Apple's Vision Pro headset, and it features a lot of translucency that Apple says 'behaves like glass in the real world.' On-screen elements now use 'real-time rendering' that lets them react to movement with highlights and color shifts. It seems like changes from Liquid Glass will touch just about every part of the operating system, including apps, buttons, sliders, the Control Center, and your homescreen. Tab bars will also change because of Liquid Glass, shrinking and expanding as you scroll up and down. Messages is getting better for group chats In Messages, iOS 26 is adding a lot of updates that could significantly improve group chats. You'll be able to customize the background of a chat to give it more personality. To help make group decisions or get an opinion on something, you can create polls. And, at long last, Apple is adding typing indicators to group chats, which should make them feel more lively. Apple is taking some cues from Google by adding a call screening feature and a 'Hold Assist' feature that can wait on a call for you. The company is also adding a new unified layout option that combines Favorites, Recents, and Voicemails all into one view. Live Translation can translate calls in real time Apple is adding an Apple Intelligence-powered feature that can translate text on your screen and translate speech back and forth in the middle of a phone call. The company is building the feature into Messages, FaceTime, and the Phone app, and Apple says its models for Live Translation run entirely on your device for privacy. Some small Apple Intelligence improvements Even though we're still waiting for Apple to announce when it will actually release its delayed improvements to Siri, iOS 26 will include some new Apple Intelligence-powered features. Visual Intelligence will let you do searches about and take action on things you see on your screen. With Genmoji, you'll be able to combine two emoji into one. And Shortcuts will be able to use Apple Intelligence models to improve your workflows. The new Games app is for everything about your games Apple's new Games app will provide a centralized hub for everything about your games on your iPhone. The Home tab shows things like updates and events in your games. The Apple Arcade tab lets you browse the company's catalog of games on the service. The Library tab shows all of the App Store games you've ever downloaded. And the Play Together tab lets you see what your friends are up to.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store