How Artificial Intelligence Reasons

26-03-2025

In September, OpenAI unveiled a new version of ChatGPT designed to reason through tasks involving math, science and computer programming. Unlike previous versions of the chatbot, this new technology could spend time 'thinking' through complex problems before settling on an answer.
Soon, the company said its new reasoning technology had outperformed the industry's leading systems on a series of tests that track the progress of artificial intelligence.
Now other companies, like Google, Anthropic and China's DeepSeek, offer similar technologies.
But can A.I. actually reason like a human? What does it mean for a computer to think? Are these systems really approaching true intelligence?
Here is a guide.
What does it mean when an A.I. system reasons?
Reasoning just means that the chatbot spends some additional time working on a problem.
'Reasoning is when the system does extra work after the question is asked,' said Dan Klein, a professor of computer science at the University of California, Berkeley, and chief technology officer of Scaled Cognition, an A.I. start-up.
It may break a problem into individual steps or try to solve it through trial and error.
The original ChatGPT answered questions immediately. The new reasoning systems can work through a problem for several seconds — or even minutes — before answering.
Can you be more specific?
In some cases, a reasoning system will refine its approach to a question, repeatedly trying to improve the method it has chosen. Other times, it may try several different ways of approaching a problem before settling on one of them. Or it may go back and check some work it did a few seconds before, just to see if it was correct.
Basically, the system tries whatever it can to answer your question.
This is kind of like a grade school student who is struggling to find a way to solve a math problem and scribbles several different options on a sheet of paper.
What sort of questions require an A.I. system to reason?
It can potentially reason about anything. But reasoning is most effective when you ask questions involving math, science and computer programming.
How is a reasoning chatbot different from earlier chatbots?
You could ask earlier chatbots to show you how they had reached a particular answer or to check their own work. Because the original ChatGPT had learned from text on the internet, where people showed how they had gotten to an answer or checked their own work, it could do this kind of self-reflection, too.
But a reasoning system goes further. It can do these kinds of things without being asked. And it can do them in more extensive and complex ways.
Companies call it a reasoning system because it feels as if it operates more like a person thinking through a hard problem.
Why is A.I. reasoning important now?
Companies like OpenAI believe this is the best way to improve their chatbots.
For years, these companies relied on a simple concept: The more internet data they pumped into their chatbots, the better those systems performed.
But in 2024, they used up almost all of the text on the internet.
That meant they needed a new way of improving their chatbots. So they started building reasoning systems.
How do you build a reasoning system?
Last year, companies like OpenAI began to lean heavily on a technique called reinforcement learning.
Through this process — which can extend over months — an A.I. system can learn behavior through extensive trial and error. By working through thousands of math problems, for instance, it can learn which methods lead to the right answer and which do not.
Researchers have designed complex feedback mechanisms that show the system when it has done something right and when it has done something wrong.
'It is a little like training a dog,' said Jerry Tworek, an OpenAI researcher. 'If the system does well, you give it a cookie. If it doesn't do well, you say, 'Bad dog.''
(The New York Times sued OpenAI and its partner, Microsoft, in December for copyright infringement of news content related to A.I. systems.)
Does reinforcement learning work?
It works pretty well in certain areas, like math, science and computer programming. These are areas where companies can clearly define the good behavior and the bad. Math problems have definitive answers.
Reinforcement learning doesn't work as well in areas like creative writing, philosophy and ethics, where the distinction between good and bad is harder to pin down. Researchers say this process can generally improve an A.I. system's performance, even when it answers questions outside math and science.
'It gradually learns what patterns of reasoning lead it in the right direction and which don't,' said Jared Kaplan, chief science officer at Anthropic.
Are reinforcement learning and reasoning systems the same thing?
No. Reinforcement learning is the method that companies use to build reasoning systems. It is the training stage that ultimately allows chatbots to reason.
Do these reasoning systems still make mistakes?
Absolutely. Everything a chatbot does is based on probabilities. It chooses a path that is most like the data it learned from — whether that data came from the internet or was generated through reinforcement learning. Sometimes it chooses an option that is wrong or does not make sense.
Is this a path to a machine that matches human intelligence?
A.I. experts are split on this question. These methods are still relatively new, and researchers are still trying to understand their limits. In the A.I. field, new methods often progress very quickly at first, before slowing down.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

ChatGPT-5 is delayed — but these 5 features could make the wait worth it

Yahoo

an hour ago

Yahoo

ChatGPT-5 is delayed — but these 5 features could make the wait worth it

When you buy through links on our articles, Future and its syndication partners may earn a commission. GPT-5 seems to keep getting further and further away. At one point, we were expecting it in the summer. Then it was July, and then the start of August. Now, a few days into August, still no GPT-5. Of course, this doesn't necessarily mean we've got long to wait. Reviewers are testing GPT-5 now, and OpenAI CEO Sam Altman has made repeated claims about his use of it and how strong its performance has been. In fact, all signs still suggest an August release. But, OpenAI has a lot going on right now, and it's no stranger to delays, often pushing products and models back months at a time. On August 2, Altman posted on X saying: 'We have a ton of stuff to launch over the next couple of months — new models, products, features, and more. Please bear with us through some probable hiccups and capacity crunches.' GPT-5 is likely to be the company's biggest launch ever, and therefore one the company wants to get right. But even if it is delayed, GPT-5 will be worth waiting for, launching with a bunch of cool new features. The five features we're most excited for in GPT-5 Sora 2 A recent leak suggested that, with GPT-5, we could finally see the launch of Sora 2. This would be the second iteration of OpenAI's video creation tool. The first Sora model had a fairly short life. It appeared, made headlines, and was quickly beaten by the competition. Since then, OpenAI seems to have put it on the back burner. Improved memory With any major update to an AI model, one of the first and most noticeable changes is in its memory. With GPT-5, the model (if you let it) is likely to better remember key details about you and past conversations. This could include your personality type, preferences, and opinions on key topics. A lot of people will have love-hate relationships with this kind of update. The likes of Claude and Le Chat have steered clear of giving their AI models too much memory. OpenAI on the other hand has lent into it. Better coding One of the big pushes right now in the world of AI models is for improved coding abilities. Chatbots are able to code entire apps, build databases and solve coding problems with surprising ease. However, they can keep getting better and GPT-5 will be a change for OpenAI to take the spot as the best coding chatbot. Agentic ChatGPT recently launched its agent tool, letting ChatGPT make actions on your behalf. This could be booking restaurant tables, finding the latest deals on laptops and buying one or simply checking your calendar. With GPT-5, we'll likely see improvements to this tool, letting you complete more complicated actions. More conversational Chatbots, while more natural-sounding than ever, continue to talk like robots. With GPT-5 and other competitors big changes, we're seeing more natural versions of chatbots pop up. With this latest version, we could see ChatGPT take on different voices and personalities for different situations, or simply speak in a more natural, conversational way when prompting. More from Tom's Guide just launched an AI-powered social feed — and it's like TikTok meets ChatGPT Sam Altman just teased GPT-5 with one question — and the answer says it all OpenAI just pulled a controversial ChatGPT feature — what you need to know

OpenAI launches two new AI models ahead of GPT-5 - here's everything you need to know

Yahoo

an hour ago

Yahoo

OpenAI launches two new AI models ahead of GPT-5 - here's everything you need to know

When you buy through links on our articles, Future and its syndication partners may earn a commission. OpenAI is once again doing side quests in the lead up to the launch of GPT-5. As we wait for the big update, OpenAI is pausing to bring us not one, but two entirely separate models to play with. Both of these new models are available to download for free to anyone with some coding ability via Hugging Face. They come in two sizes, with the larger option being the more capable gpt-oss-120b model that can run on just one single Nvidia GPU, and a second smaller model, called gpt-oss-20b. This one can run on a consumer laptop with 16GB of memory. This is the first time OpenAI has launched an open weight model in years, and has been delaying its release for a while now. While smaller AI companies like Le Chat, Deepseek, and Alibaba have frequently released open-weight models, OpenAI has tended to keep their doors closed off. Sam Altman, CEO of OpenAI, said at the start of the year that OpenAI felt it was on the wrong side of history for this, suggesting they would be going back to launching some open-source model What are open-weight models? Quite simply, an open-weight model is one where all of its training parameters are made publicly available. Developers can access these, analyzing and fine tuning them for their own projects. In such a competitive market, it seems strange for this to be a thing. And yet, it is a very popular option, with some of the most powerful models on the market being open-weighted. Of course, GPT-5 won't be, neither would the likes of Grok and Claude's top models. But that isn't to say that this new option from OpenAI isn't powerful. When put through tests, OpenAI's two new models both performed ahead of Deepseek's R1 and in a similar line to some of OpenAI's other reasoning models. In both models, the full chain of thought can be accessed, making for easier debugging of code and higher trust in the models. What does this mean for you? If you're a developer in the AI space, this will be big news for you. OpenAI took a long break from offering out its weights available to the public, and there is a clear shift in their thinking for this to become available. For everybody else, this won't be of much importance. The big update for the average person will be GPT-5 when that launches in the next week or so. OpenAI did promise a lot of big updates in the next few weeks, with this just being the starter for the main course soon to come. More from Tom's Guide ChatGPT-5 is coming — here's how it could change the way we prompt forever Amazon may bring ads to Alexa+ in least surprising move ever OpenAI says they are no longer optimizing ChatGPT to keep you chatting — here's why

OpenAI says they are no longer optimizing ChatGPT to keep you chatting — here's why

Yahoo

an hour ago

Yahoo

OpenAI says they are no longer optimizing ChatGPT to keep you chatting — here's why

When you buy through links on our articles, Future and its syndication partners may earn a commission. With over 180.5 million monthly active users and nearly 2.5 billion prompts per day, OpenAI recently revealed it is optimizing ChatGPT to help, not hook. In a new blog post titled 'What we're optimizing ChatGPT for,' OpenAI revealed it's moving away from traditional engagement metrics like time spent chatting. Instead, the company says it's now prioritizing user satisfaction, task completion and overall usefulness. This is an unconventional stance, as apps like TikTok, Meta, and similar Silicon Valley companies strive to keep users tied to their screens. 'We're not trying to maximize the time you spend with ChatGPT,' OpenAI wrote. 'We want you to use it when it's helpful, and not use it when it isn't.' Less stickiness, more usefulness While many platforms chase user attention, often with addictive features, OpenAI says it's focused on building a helpful assistant that respects your time. That means ChatGPT won't be trying to keep you talking just for the sake of it. Instead, it's being shaped into a tool that helps you solve problems, learn something new or complete a task, and then get on with your day. This approach mirrors recent updates like Study Mode and ChatGPT Agent, both of which are designed to get things done rather than entertain. Together, they reflect OpenAI's growing focus on goal-oriented AI over engagement-first design. AI that helps, not hooks Rather than acting like a social app that wants you to linger, OpenAI says ChatGPT is being tuned to behave more like a true assistant that offers answers, structure and support without dragging you into an endless chat spiral. Behind the scenes, OpenAI says it's incorporating feedback from its Superalignment and Preparedness teams, along with trust and safety evaluations, to ensure the assistant is more transparent, less sycophantic and better at knowing when to be concise. OpenAI also acknowledges that people use ChatGPT in different ways; some want speed, others want depth; some prefer playful conversation, others want straight answers. The goal is to improve default settings while still allowing user customization, just not at the cost of clarity or mental load. What it means for you If you use ChatGPT for studying, writing, planning or productivity, you may soon notice: More focused replies Less chatty filler or small talk Smarter task completion and summaries Shorter, more efficient interactions when needed The takeaway OpenAI's shift is part of a broader trend toward human-centered AI; tools that support your work and well-being without demanding your attention in return. The company's redefined vision for ChatGPT is simple: help people more, distract them less. In a world full of apps designed to hook you, that's a surprisingly radical move and one that could (hopefully) set a new standard for AI design going forward. Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button. More from Tom's Guide Perplexity accused of scraping websites even when told not to — here's their response Grok launches AI image generator with a NSFW 'spicy mode' — it's exactly what you'd expect just launched an AI-powered social feed — and it's like TikTok meets ChatGPT

How Artificial Intelligence Reasons

Hashtags

Try Our AI Features

Comments

Related Articles

ChatGPT-5 is delayed — but these 5 features could make the wait worth it

OpenAI launches two new AI models ahead of GPT-5 - here's everything you need to know

OpenAI says they are no longer optimizing ChatGPT to keep you chatting — here's why

Get Started Now: Download the App