logo
Google just fired the first shot of the next battle in the AI war

Google just fired the first shot of the next battle in the AI war

A new research paper, co-authored by Google's David Silver, just proposed a radical new AI era.
"The Era of Experience" tackles training data scarcity by having AI agents generate their own data.
This may be a Google dis of OpenAI and the current approach of using human data to train AI models.
There are so many AI research papers these days, it's hard to stand out. But one paper has fired up a lot of discussion across the tech industry in recent days.
"This is the most inspiring thing I've read in AI in the last two years," startup founder Suhail Doshi wrote on X this weekend. Jack Clark, cofounder of Anthropic, featured the paper in Monday's edition of his Import AI newsletter, which is closely read by thousands of industry researchers.
Written by Google researcher David Silver and Canadian computer scientist Rich Sutton, the paper boldly announces a new AI era.
The authors identify two previous modern AI eras. The first was epitomized by AlphaGo, a Google AI model that famously learned to play the board game "Go" better than humans in 2015. The second is the one we're in right now, defined by OpenAI's ChatGPT.
Silver and Sutton say we're now entering a new period called "the Era of Experience."
For me, this represents a new attempt by Google to tackle one of AI's most persistent problems — the scarcity of training data — while moving beyond a technological approach that OpenAI basically won.
The Simulation Era
Let's start with the first era, which, according to the authors, was the "Simulation Era."
In this period, roughly the mid-2010s, researchers used digital simulations to get AI models to play games repeatedly to learn how to perform like humans. We're talking millions and millions of games, such as chess, poker, Atari, and "Gran Turismo," played over and over, with rewards dangled for good results — thus teaching the machines what's good versus bad and incentivizing them to pursue better strategies.
This method of reinforcement learning, or RL, produced Google's AlphaGo. And it also helped to create another Google model called AlphaZero, which discovered new strategies for chess and "Go," and changed the way that humans play these games.
The problem with this approach: Machines trained this way did well on specific problems with precisely defined rewards, but couldn't tackle more general, open-ended problems with vague payoffs, according to the authors. So, probably not really full AI.
The Human Data Era
The next area was kicked off by another Google research paper published in 2017. " Attention is All You Need" proposed that AI models should be trained on mountains of human-created data from the internet. Just by allowing machines to pay "attention" to all this information, they would learn to behave like humans and perform as well as us on a wide variety of different tasks.
This is the era we're in now, and it has produced ChatGPT and most of the other powerful generative AI models and tools that are increasingly being used to automate tasks such as graphic design, content creation, and software coding.
The key to this era has been amassing as much high-quality, human-generated data as possible, and using that in massive, compute-intensive training runs to imbue AI models with an understanding of the world.
While Google researchers kicked off this era of human data, most of these people left the company and started their own things. Many went to OpenAI and worked on technology that ultimate produced ChatGPT, which is by far the most successful generative AI product in history. Others went on to start Anthropic, another leading generative AI startup that runs Claude, a powerful chatbot and AI agent.
A Google dis?
Many experts in the AI industry, and some investors and analysts on Wall Street, think that Google may have dropped the ball here. It came up with this AI approach, but OpenAI and ChatGPT have run away with most of the spoils so far.
I think the jury is still out. However, you can't help but think about this situation when the authors seem to be dissing the era of human data.
"It could be argued that the shift in paradigm has thrown out the baby with the bathwater," they wrote. "While human-centric RL has enabled an unprecedented breadth of behaviours, it has also imposed a new ceiling on the agent's performance: agents cannot go beyond existing human knowledge."
Silver and Sutton are right about one aspect of this. The supply of high-quality human data has been outstripped by the insatiable demand from AI labs and Big Tech companies that need fresh content to train new models and move their abilities forward. As I wrote last year, it has become a lot harder and more expensive to make big leaps at the AI frontier.
The Era of Experience
The authors have a pretty radical solution for this, and it's at the heart of the new Era of Experience that they propose in this paper.
They suggest that models and agents should just get out there and create their own new data through interactions with the real world.
This will solve the nagging data-supply problem, they argue, while helping the field attain AGI, or artificial general intelligence, a technical holy grail where machines outperform humans in most useful activities.
"Ultimately, experiential data will eclipse the scale and quality of human-generated data," Silver and Sutton write. "This paradigm shift, accompanied by algorithmic advancements in RL, will unlock in many domains new capabilities that surpass those possessed by any human."
Any modern parent can think of this as the equivalent of telling their child to get off the couch, stop looking at their phone, and go be outside and play with their friends. There are a lot richer, satisfying, and more valuable experiences out there to learn from.
Clark, the Anthropic cofounder, was impressed by the chutzpah of this proposal.
"Papers like this are emblematic of the confidence found in the AI industry," he wrote in his newsletter on Monday, citing "the gumption to give these agents sufficient independence and latitude that they can interact with the world and generate their own data."
Examples, and a possible final dis
The authors float some theoretical examples of how this might work in the new Era of Experience.
An AI health assistant could ground a person's health goals into a reward based on a combination of signals such as their resting heart rate, sleep duration, and activity levels. (A reward in AI is a common way to incentivize models and agents to perform better. Just like you might nag your partner to exercise more by saying they'll get stronger and look better if they go to the gym.)
An educational assistant could use exam results to provide an incentive or reward, based on a grounded reward for a user's language learning.
A science agent with a goal to reduce global warming might use a reward based on empirical observations of carbon dioxide levels, Silver and Sutton suggest.
In a way, this is a return to the previous Era of Simulation, which Google arguably led. Except this time, AI models and agents are learning from the real world and collecting their own data, rather than existing in a video game or other digital realm.
The key is that, unlike the Era of Human Data, there may be no limit to the information that can be generated and gathered for this new phase of AI development.
In our current human data period, something was lost, the authors argue: an agent's ability to self-discover its own knowledge.
"Without this grounding, an agent, no matter how sophisticated, will become an echo chamber of existing human knowledge," Silver and Sutton wrote, in a possible final dis to OpenAI.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Big tech on a quest for ideal AI device
Big tech on a quest for ideal AI device

Yahoo

timean hour ago

  • Yahoo

Big tech on a quest for ideal AI device

ChatGPT-maker OpenAI has enlisted the legendary designer behind the iPhone to create an irresistible gadget for using generative artificial intelligence (AI). The ability to engage digital assistants as easily as speaking with friends is being built into eyewear, speakers, computers and smartphones, but some argue that the Age of AI calls for a transformational new gizmo. "The products that we're using to deliver and connect us to unimaginable technology are decades old," former Apple chief design officer Jony Ive said when his alliance with OpenAI was announced. "It's just common sense to at least think, surely there's something beyond these legacy products." Sharing no details, OpenAI chief executive Sam Altman said that a prototype Ive shared with him "is the coolest piece of technology that the world will have ever seen." According to several US media outlets, the device won't have a screen, nor will it be worn like a watch or broach. Kyle Li, a professor at The New School, said that since AI is not yet integrated into people's lives, there is room for a new product tailored to its use. The type of device won't be as important as whether the AI innovators like OpenAI make "pro-human" choices when building the software that will power them, said Rob Howard of consulting firm Innovating with AI - Learning from flops - The industry is well aware of the spectacular failure of the AI Pin, a square gadget worn like a badge packed with AI features but gone from the market less than a year after its debut in 2024 due to a dearth of buyers. The AI Pin marketed by startup Humane to incredible buzz was priced at $699. Now, Meta and OpenAI are making "big bets" on AI-infused hardware, according to CCS Insight analyst Ben Wood. OpenAI made a multi-billion-dollar deal to bring Ive's startup into the fold. Google announced early this year it is working on mixed-reality glasses with AI smarts, while Amazon continues to ramp up Alexa digital assistant capabilities in its Echo speakers and displays. Apple is being cautious embracing generative AI, slowly integrating it into iPhones even as rivals race ahead with the technology. Plans to soup up its Siri chatbot with generative AI have been indefinitely delayed. The quest for creating an AI interface that people love "is something Apple should have jumped on a long time ago," said Futurum research director Olivier Blanchard. - Time to talk - Blanchard envisions some kind of hub that lets users tap into AI, most likely by speaking to it and without being connected to the internet. "You can't push it all out in the cloud," Blanchard said, citing concerns about reliability, security, cost, and harm to the environment due to energy demand. "There is not enough energy in the world to do this, so we need to find local solutions," he added. Howard expects a fierce battle over what will be the must-have personal device for AI, since the number of things someone is willing to wear is limited and "people can feel overwhelmed." A new piece of hardware devoted to AI isn't the obvious solution, but OpenAI has the funding and the talent to deliver, according to Julien Codorniou, a partner at venture capital firm 20VC and a former Facebook executive. OpenAI recently hired former Facebook executive and Instacart chief Fidji Simo as head of applications, and her job will be to help answer the hardware question. Voice is expected by many to be a primary way people command AI. Google chief Sundar Pichai has long expressed a vision of "ambient computing" in which technology blends invisibly into the world, waiting to be called upon. "There's no longer any reason to type or touch if you can speak instead," Blanchard said. "Generative AI wants to be increasingly human" so spoken dialogues with the technology "make sense," he added. However, smartphones are too embedded in people's lives to be snubbed any time soon, said Wood. tu-gc/arp/nl Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

3 Magnificent S&P 500 Dividend Stocks Down 15% to 65% to Buy and Hold Forever
3 Magnificent S&P 500 Dividend Stocks Down 15% to 65% to Buy and Hold Forever

Yahoo

time2 hours ago

  • Yahoo

3 Magnificent S&P 500 Dividend Stocks Down 15% to 65% to Buy and Hold Forever

Alphabet is the cheapest Magnificent Seven stock and is vastly underrated. Semiconductor equipment supplier Applied Materials should deliver AI-powered double-digit dividend growth. The beaten-down retailer Target pays almost 5% and has been around since 1902. 10 stocks we like better than Alphabet › When a great company runs into a short-term problem or just mere skepticism, it can make for an excellent opportunity for the long-term investor. And if such a company pays a rising dividend that can grow over time, that's a big future passive income opportunity. Currently, skepticism abounds for the following three S&P 500 dividend stocks. But scooping up shares today could pay big dividends -- pun intended -- over the long run. The "Magnificent Seven" stocks are generally some of the strongest, most resilient, and innovative companies around, and Alphabet (NASDAQ: GOOG) (NASDAQ: GOOGL) is by far the cheapest of the bunch. Shares trade at just around 20 times earnings, not only a discount to peers but even the overall market. And while its dividend is only 0.5%, Alphabet's payout ratio is just 8.9%, leaving huge room for growth. Instead of a bigger dividend, Alphabet is returning cash to shareholders via share repurchases, while also investing in future AI growth. Of course, the reason Alphabet's stock has been under pressure is due to concerns over its main cash cow, Google Search, and the threat posed by AI chatbots. While that is something to monitor, Alphabet has been innovating in AI rapidly, delivering "AI Overviews" when users search a topic, which management claims monetizes at the same rate as Search. The company also unveiled "AI Mode" in Google Search on May 20, offering the experience of a chatbot within the Search ecosystem. Furthermore, Alphabet has been rapidly catching up to AI chatbot leader OpenAI in large language models (LLMs). When Alphabet's Gemini 2.5 LLM was released on March 25, it immediately rocketed up the developer rankings among top-performing LLMs, seizing the current lead for many applications. Given Alphabet's technical talent and financial resources, I'd expect the company to muddle through this transition. But even if Search growth slows down, there are other AI-related growth opportunities related to Gemini-powered chatbots and agents. Meanwhile, Alphabet has three other large and growing businesses the market appears to be ignoring: YouTube, Google Cloud, and Waymo. YouTube is the the largest streaming company in the world, and growing by double-digits. Google Cloud, while the third-place infrastructure platform, still grew 28% last quarter to a $50 billion annual revenue run-rate, and has been profitable since the first quarter of 2023. And then there's Waymo, which has taken a leading pole position in the autonomous taxi industry. While Tesla is ramping up its rival service this month, Waymo has a five-year head start, has doubled its rides over the past five months, and is now doing over 250,000 autonomous rides a week across four cities. Despite a recent bounce, semiconductor equipment supplier Applied Materials (NASDAQ: AMAT) is still 33% below its July 2024 highs. However, Applied is one of the highest-quality businesses you'll find in tech. It's a leader in etch and deposition semiconductor equipment, with additional franchises in metrology and ion implant machines. These machines play key parts in the production of leading-edge semiconductors and memory needed for AI. Applied also has a highly profitable services business attached to that growing installed base of equipment, with recurring-like services making up 22% of the company's revenue last quarter. As one of just a few companies that make these extremely advanced machines needed to make today's leading-edge semiconductors, Applied is set to win big from the AI revolution. And its deep technology moat enables high margins and returns on capital. Applied currently pays a 1.1% dividend, but like Alphabet, it also pays most of its shareholder returns out in the form of share repurchases, with a low dividend payout ratio of just 19.5%. That leaves a lot of room to grow that dividend; in fact, Applied just raised its dividend by 15% this year, and I'd expect more double-digit increases into the future. Although its business isn't quite as robust and doesn't have the technology growth prospects of Alphabet or Applied Materials, retailer Target (NYSE: TGT) is much cheaper, trading at just 11 times earnings, with a hefty 4.6% dividend. The stock is also down a whopping 64% from its all-time highs. Yes, Target is seeing some declines in revenue this year, but the company is still going to be profitable. Unlike rivals Walmart and Costco, Target isn't known for ultra-low prices. But Target stores are still competitive, and are usually in more convenient locations. Although it stocks a lot of everyday items, Target also tilts more toward discretionary purchase items such as apparel. With the high inflationary period of the last few years and overhang of tariffs, consumers have tightened their their belts and are making fewer discretionary purchases, hurting Target's market share. However, the inflation of the past few years now seems to finally be ebbing. If it does, a rebound could be in the cards. Meanwhile, Target management has pointed to some green shoots in the business. For instance, the digital business grew in the mid-single digits last quarter, with 36% growth in same-day delivery. And Target said it has improved its mitigation of "shrink," or theft, which had increased since the pandemic. Target has been around since 1902, and CEO Brian Cornell has steered the company through several crises before. Investors should expect Target to continue to adapt and recover. If the business merely stabilizes, the stock could still do well going forward, in light of its beaten-down valuation. Before you buy stock in Alphabet, consider this: The Motley Fool Stock Advisor analyst team just identified what they believe are the for investors to buy now… and Alphabet wasn't one of them. The 10 stocks that made the cut could produce monster returns in the coming years. Consider when Netflix made this list on December 17, 2004... if you invested $1,000 at the time of our recommendation, you'd have $655,255!* Or when Nvidia made this list on April 15, 2005... if you invested $1,000 at the time of our recommendation, you'd have $888,780!* Now, it's worth noting Stock Advisor's total average return is 999% — a market-crushing outperformance compared to 174% for the S&P 500. Don't miss out on the latest top 10 list, available when you join . See the 10 stocks » *Stock Advisor returns as of June 9, 2025 Suzanne Frey, an executive at Alphabet, is a member of The Motley Fool's board of directors. Billy Duberstein and/or hos clients have positions in Alphabet, Applied Materials, and Costco Wholesale. The Motley Fool has positions in and recommends Alphabet, Applied Materials, Costco Wholesale, Target, Tesla, and Walmart. The Motley Fool has a disclosure policy. 3 Magnificent S&P 500 Dividend Stocks Down 15% to 65% to Buy and Hold Forever was originally published by The Motley Fool Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Policy / Politics Tanks, guns and face-painting The uncanny festivities of the US Army's 250th anniversary, celebrated on Donald Trump's birthday. by Tina Nguyen Jun 14, 2025, 9:36 PM EDT Link Facebook Threads 0 Comments / 0 New AFP via Getty Images
Policy / Politics Tanks, guns and face-painting The uncanny festivities of the US Army's 250th anniversary, celebrated on Donald Trump's birthday. by Tina Nguyen Jun 14, 2025, 9:36 PM EDT Link Facebook Threads 0 Comments / 0 New AFP via Getty Images

The Verge

time2 hours ago

  • The Verge

Policy / Politics Tanks, guns and face-painting The uncanny festivities of the US Army's 250th anniversary, celebrated on Donald Trump's birthday. by Tina Nguyen Jun 14, 2025, 9:36 PM EDT Link Facebook Threads 0 Comments / 0 New AFP via Getty Images

In what's become a bit of a Decoder tradition, I spoke with Google CEO Sundar Pichai in person after I/O. The conference this year was all about AI, particularly a slew of actual AI products, not just models and capabilities. To Sundar, this marks the beginning of a new era for search and the web overall. So I had to ask: what happens to the web when AI tools and eventually agents do most of the browsing for us? It was a very Decoder conversation.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store