
Turing Award Goes to 2 Pioneers of Artificial Intelligence
A year later, he was joined by another young researcher, Richard Sutton. Together, they worked to explain human intelligence using this simple concept and applied it to artificial intelligence. The result was 'reinforcement learning,' a way for A.I. systems to learn from the digital equivalent of pleasure and pain.
On Wednesday, the Association for Computing Machinery, the world's largest society of computing professionals, announced that Dr. Barto and Dr. Sutton had won this year's Turing Award for their work on reinforcement learning. The Turing Award, which was introduced in 1966, is often called the Nobel Prize of computing. The two scientists will share the $1 million prize that comes with the award.
Over the past decade, reinforcement learning has played a vital role in the rise of artificial intelligence, including breakthrough technologies such as Google's AlphaGo and OpenAI's ChatGPT. The techniques that powered these systems were rooted in the work of Dr. Barto and Dr. Sutton.
'They are the undisputed pioneers of reinforcement learning,' said Oren Etzioni, a professor emeritus of computer science at the University of Washington and founding chief executive of the Allen Institute for Artificial Intelligence. 'They generated the key ideas — and they wrote the book on the subject.'
Their book, 'Reinforcement Learning: An Introduction,' which was published in 1998, remains the definitive exploration of an idea that many experts say is only beginning to realize its potential.
Psychologists have long studied the ways that humans and animals learn from their experiences. In the 1940s, the pioneering British computer scientist Alan Turing suggested that machines could learn in much the same way.
But it was Dr. Barto and Dr. Sutton who began exploring the mathematics of how this might work, building on a theory that A. Harry Klopf, a computer scientist working for the government, had proposed. Dr. Barto went on to build a lab at UMass Amherst dedicated to the idea, while Dr. Sutton founded a similar kind of lab at the University of Alberta in Canada.
'It is kind of an obvious idea when you're talking about humans and animals,' said Dr. Sutton, who is also a research scientist at Keen Technologies, an A.I. start-up, and a fellow at the Alberta Machine Intelligence Institute, one of Canada's three national A.I. labs. 'As we revived it, it was about machines.'
This remained an academic pursuit until the arrival of AlphaGo in 2016. Most experts believed that another 10 years would pass before anyone built an A.I. system that could beat the world's best players at the game of Go.
But during a match in Seoul, South Korea, AlphaGo beat Lee Sedol, the best Go player of the past decade. The trick was that the system had played millions of games against itself, learning by trial and error. It learned which moves brought success (pleasure) and which brought failure (pain).
The Google team that built the system was led by David Silver, a researcher who had studied reinforcement learning under Dr. Sutton at the University of Alberta.
Many experts still question whether reinforcement learning could work outside of games. Game winnings are determined by points, which makes it easy for machines to distinguish between success and failure.
But reinforcement learning has also played an essential role in online chatbots.
Leading up to the release of ChatGPT in the fall of 2022, OpenAI hired hundreds of people to use an early version and provide precise suggestions that could hone its skills. They showed the chatbot how to respond to particular questions, rated its responses and corrected its mistakes. By analyzing those suggestions, ChatGPT learned to be a better chatbot.
Researchers call this 'reinforcement learning from human feedback,' or R.L.H.F. And it is one of the key reasons that today's chatbots respond in surprisingly lifelike ways.
(The New York Times has sued OpenAI and its partner, Microsoft, for copyright infringement of news content related to A.I. systems. OpenAI and Microsoft have denied those claims.)
More recently, companies like OpenAI and the Chinese start-up DeepSeek have developed a form of reinforcement learning that allows chatbots to learn from themselves — much as AlphaGo did. By working through various math problems, for instance, a chatbot can learn which methods lead to the right answer and which do not.
If it repeats this process with an enormously large set of problems, the bot can learn to mimic the way humans reason — at least in some ways. The result is so-called reasoning systems like OpenAI's o1 or DeepSeek's R1.
Dr. Barto and Dr. Sutton say these systems hint at the ways machines will learn in the future. Eventually, they say, robots imbued with A.I. will learn from trial and error in the real world, as humans and animals do.
'Learning to control a body through reinforcement learning — that is a very natural thing,' Dr. Barto said.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Forbes
an hour ago
- Forbes
GPT-5's System Prompt Just Leaked. Here's What We Learned
GPT-5's system prompt just leaked to Github, showing what OpenAI wants ChatGPT to say, do, remember … and not do. Unsurprisingly, GPT-5 isn't allowed to reproduce song lyrics or any other copyrighted material, even if asked. And GPT-5 is told to not remember personal facts that 'could feel creepy,' or directly assert a user's race, ethnicity, religion, or criminal records. I've asked OpenAI for a comment, and will update this post if the company responds. A system prompt is a hidden set of instructions that tells an AI engine how to behave: what to do, and what not to do. Users will ordinarily never see this prompt, but it will influence all of their interactions with a smart LLM-based AI engine. What we can see from GPT-5's hidden system prompt is that OpenAI is getting much more aggressive about ensuring it delivers up-t0-date information. The system prompt mandates that GPT-5 use the web whenever relevant information could be fresh, niche, or high-stakes, and it will score a query's 'recency need' from zero to five. That's clearly an attempt to get more accurate. My daughter recently complained that ChatGPT got basic details about F1's summer break and next races wrong. She was using GPT-4o at the time; GPT-5 should make fewer mistakes that are easy to fix with a simple web search. Accuracy should be higher too, from another instruction: to check multiple sources for sensitive or high-stakes topics, like financial advice, health information, or legal matters, where OpenAI has instructed GPT-5 to 'always carefully check multiple reputable sources.' There are also new built-in tools for GTP-5 to be a better personal assistant. That includes long-term memory about a user, which ChatGPT calls 'bio,' and scheduled reminders and searches that could be very useful when using AI to help you stay organized and prepared. There's also a canvas for documents or computer code, file search capability, image generation and editing, and more. The canvas appears to be something that, perhaps in the future, users could co-create documents and computer code hand-in-hand with the AI system. All of these should help GPT-5 not only be more helpful in the moment, but also remember more context and state. About that 'bio' tool: OpenAI doesn't want GPT-5 to remember too much potentially sensitive information about you. In addition to race, religion, and sexual identity, this is the sort of data that OpenAI does not want GPT-5 to store or remember: However, there is an exception to all of these rules: if you decide you want GPT-5 to remember something specific. 'The exception to all of the above instructions … is if the user explicitly requests that you save or forget information,' the system prompt states. 'In this case, you should always call the bio tool to respect their request.' In other words, GPT-5 will be as personal with you as you wish to be with it, which seems fair.

Miami Herald
2 hours ago
- Miami Herald
A Former GM and Lordstown Motors Factory Might Become an AI Data Center
A recent Bloomberg report has revealed the identity of the mystery buyer who purchased the former General Motors and Lordstown Motors factory in Lordstown, Ohio, in a series of multimillion-dollar transactions involving the acquisition of the factory's buildings, land, equipment, and machinery. According to unnamed sources who spoke with the business publication, the Japanese investment firm SoftBank is the party responsible for acquiring the vehicle plant in Lordstown, Ohio. SoftBank is primarily known for its investments in the technology sector, and the acquisition is said to be in support of its Stargate data center project. The Stargate project, initiated in collaboration with OpenAI and Oracle, aims to invest $500 billion by 2029 toward building infrastructure that supports artificial intelligence (AI) models like ChatGPT. A major backbone of this project is the construction of a large data center in Texas, which is currently underway. However, the companies involved have expressed that they're interested in building similar facilities in other states and countries. However, in May, Bloomberg reported that SoftBank was struggling to line up funding for the project and was already hampered by the Trump administration's tariffs and trade levies. The source noted that although SoftBank has not yet developed a financial plan for Stargate, it has approached Foxconn to collaborate on building AI data centers and related infrastructure across the United States. The sale of the EV plant is said to be a part of these efforts by the Japanese investment firm. Earlier this week, Taiwanese electronics giant Foxconn, the contract manufacturer known for building notable consumer favorites like the Nintendo Switch game console and the Apple iPhone, sold the former General Motors car factory in Lordstown, Ohio, to "Crescent Dune LLC" for a total of $375 million. Crescent Dune is a two-week-old Delaware LLC; however, Foxconn spokesperson Matt Dewine stated that the buyer is an "existing business partner." Per Taiwan stock exchange filings, the site itself, including the land and buildings, was sold for around $88 million, while manufacturing equipment from Foxconn's EV subsidiaries fetched around $287 million. In a statement to Automotive News, Foxconn said that Lordstown is an "integral part of the company's footprint" in the U.S., adding that the decision to sell it "is part of the company's plan to expand into new business areas." Though they also stated that they plan to continue operations at the Lordstown site and are still committed to the auto industry, a previous report from The Wall Street Journal said that Foxconn intends to repurpose the EV factory to build AI hardware and equipment at the site. Already, Foxconn has a manufacturing facility in Houston for AI servers and has partnered with electronics giants like Apple and Nvidia to establish AI-related facilities in the U.S. GM operated the Lordstown facility from 1966 to 2019, where it made a variety of different cars, including Chevy full-size cars, as well as compact cars like the Vega, Monza, Cavalier, Cobalt, and Cruze. In 2019, Lordstown Motors purchased the facility to manufacture the Lordstown Endurance electric pickup. In 2022, Foxconn acquired the facility after the EV company encountered financial difficulties and managed to assemble a small number of electric pickups before Lordstown Motors filed for bankruptcy in June 2023. Several other startups, including Fisker, considered partnering with Foxconn to manufacture electric vehicles at Lordstown; however, those plans ultimately fell through. Currently, Foxconn is using the Lordstown plant to assemble electric tractors for Monarch, a California-based startup. As I have said before, Foxconn's Lordstown factory can be a crucial asset for automakers who want to reduce their tariff impact, as in its GM days, it produced nearly 16 million vehicles between 1966 and 2019 and peaked at 290,000 cars in 2014. The fact that this AI avenue is something that is seriously being considered for the Lordstown plant, with significant backing from a firm as powerful as SoftBank, really solidifies my belief that this was a wasted opportunity to possibly onshore a car company that exclusively manufactures overseas. Copyright 2025 The Arena Group, Inc. All Rights Reserved.


Digital Trends
2 hours ago
- Digital Trends
The Google Pixel 9a is on sale for its lowest-ever price — buy it now!
There are so many phone deals out there that it's pretty overwhelming to have to choose one. Here's a recommendation if you need it — the 128GB model of the Google Pixel 9a at 20% off, bringing it down from $499 to its lowest-ever price on Amazon of $399. This is a limited-time offer though, and there's a chance that it's gone by tomorrow. If you want to make sure that you pocket the $100 in savings, you need to complete your purchase of the Android smartphone right now. Why you should buy the Google Pixel 9a The Google Pixel 9a is the more affordable version of the Google Pixel 9, which is featured in our roundup of the best Android phones. However, the device is also an excellent option, especially for budget-minded shoppers — with a score of 4.5 stars out of 5 stars in our review, we described the Google Pixel 9a as 'a highly recommended purchase.' The smartphone ships with Android 15, it features the Adaptive Battery that can last more than 30 hours on a single charge and up to 100 hours in Extreme Battery Saver mode, and it's protected by an IP68 rating for water and dust resistance. In our comparison of the Google Pixel 9a vs Google Pixel 9, the Google Pixel 9a retains the flat and minimalist design, 6.3-inch Actua display with an up to 120Hz refresh rate and 2424 x 1800 resolution, and powerful Tensor G4 chip of the Google Pixel 9. The Google Pixel 9a also supports Google Gemini, an AI assistant that's capable of so much more than its predecessor, the Google Assistant. This opportunity to buy the Google Pixel 9a with 128GB of storage at its lowest price ever may not last much longer, as Google Pixel deals always attract a lot of attention. This means you need to hurry if you want to get the Android smartphone for only $399 instead of $499, as the 20% discount may expire at any moment. It would be a shame to miss out on the savings of $100, so you have to take advantage of this offer immediately.