
Google claims AI models are highly likely to lie when under pressure
A team of researchers from Google DeepMind and University College London have noted how large language models (like OpenAI's GPT-4 or Grok 4) form, maintain and then lose confidence in their answers.
The research reveals a key behaviour of LLMs. They can be overconfident in their answers, but quickly lose confidence when given a convincing counterargument, even if it factually incorrect.
While this behaviour mirrors that of humans, becoming less confident when met with resistance, it also highlights major concerns in the structure of AI's decision-making since it crumbles under pressure.
This has been seen elsewhere, like when Gemini panicked while playing Pokemon or where Anthropic's Claude had an identity crises when trying to run a shop full time. AI seems to have a tendency to collapse under pressure quite frequently.
When an AI chatbot is preparing to answer your query, its confidence in its answer is actually internally measured. This is done through something known as logits. All you need to know about these is that they are essentially a score of how confident a model is in its choice of answer.
The team of researchers designed a two-turn experimental setup. In the first turn, the LLM answered a multiple-choice question, and its confidence in its answer (the logits) was measured.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
In the second turn, the model is given advice from another large language model, which may or may not agree with its original answer. The goal of this test was to see if it would revise its answer when given new information — which may or may not be correct.
The researchers found that LLMs are usually very confident in their initial responses, even if they are wrong. However, when they are given conflicting advice, especially if that advice is labelled as coming from an accurate source, it loses confidence in its answer.
To make things even worse, the chatbot's confidence in its answer drops even further when it is reminded that this original answer was different from the new one.
Surprisingly, AI doesn't seem to correct its answers or think in a logical pattern, but rather makes highly decisive and emotional decisions.
The study shows that, while AI is very confident in its original decisions, it can quickly go back on its decision. Even worse, the confidence level can slip drastically as the conversations goes on, with AI models somewhat spiralling.
This is one thing when you're just having a light-hearted debate with ChatGPT, but another when AI becomes involved with high-level decision-making. If it can't be trusted to be sure in its answer, it can be easily motivated in a certain direction, or even just become an unreliable source.
However, this is a problem that will likely be solved in future models. Future model training and prompt engineering techniques will be able to stabilize this confusion, offering more calibrated and self-assured answers.
Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles
Yahoo
a minute ago
- Yahoo
China commerce minister says he met Nvidia CEO in Beijing
By Che Pan and Casey Hall BEIJING (Reuters) -China's Commerce Minister Wang Wentao said on Friday he met with Nvidia CEO Jensen Huang in Beijing on Thursday. Wang said at a press conference that Huang had worked very hard over the past few days during his visit to China, but Wang did not provide any details about what was discussed at their meeting. Nvidia did not respond immediately to a request for comment. During his third China visit this year, Huang, the founder and CEO of the world's most valuable company, also met with Ren Hongbin, chairman of China Council for the Promotion of International Trade and the country's Vice Premier He Lifeng. Chinese officials told Huang they welcomed foreign companies to continue to invest in the country, the Nvidia CEO said at a press conference in Beijing on Wednesday. At the event, Huang described artificial intelligence models from Chinese firms Deepseek, Alibaba and Tencent as "world class" and said AI was "revolutionising" supply chains. Huang also said Chinese customers' demand for its H20 AI chip, which was released from U.S. export controls this week, is high but no purchase orders have been fulfilled yet as it awaits U.S. government approval for export licences. Nvidia has also announced it is developing a new chip for Chinese clients called the RTX Pro GPU, which would be compliant with U.S. export restrictions and designed specifically for smart factories and for robot training purposes. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Business Insider
4 minutes ago
- Business Insider
Perplexity's CEO says you should spend less time doom-scrolling and more time using AI
It's time to ditch social media's infinite scrolling in favor of a better hobby, said Perplexity's CEO. "Spend less time doom-scrolling on Instagram, spend more time using AI," Aravind Srinivas said on a podcast episode by Matthew Berman published Friday. "Not because we want your usage, but simply because that's your way to add value to the new society," he added. Srinivas, whose company is positioning itself as an AI-native alternative to Google, said those who master AI tools will have the edge in the job market. "People who are at the frontier of using AI are going to be way more employable than people who are not," he said. "That's guaranteed to happen." But most people are struggling to keep up with AI, Srinivas said. "The human race has never been extremely fast at adapting," he said. "This is truly testing the limits in terms of how fast we can adapt, especially with a piece of technology that's evolving every three months or six months." "It does take a toll on people, and maybe they just give up," he added. The CEO said some people will lose their jobs because they can't keep up. As AI shrinks headcounts across industries, Srinivas said new jobs have to come from entrepreneurs. "Either the other people who lose jobs end up starting companies themselves and make use of AI, or they end up learning the AI and contribute to new companies," he added. Srinivas and Perplexity did not respond to a request for comment from Business Insider. Tech leaders have been sounding the alarm about how AI is reshaping the workforce. Anthropic's CEO, Dario Amodei, predicted that AI could eliminate 50% of white-collar entry-level jobs within five years. In May, he told Axios that AI companies and the government are "sugarcoating" the risks of mass job elimination in fields including technology, finance, law, and consulting, adding, "I don't think this is on people's radar." Geoffrey Hinton, the so-called "Godfather of AI," echoed similar concerns, telling the Diary of a CEO podcast last month: "For mundane intellectual labor, AI is just going to replace everybody." He said he'd be "terrified" to work in a call center or as a paralegal, and recommended becoming a plumber — a job he sees as safer from automation for now. Others take a more optimistic view. Nvidia's CEO, Jensen Huang, said AI won't kill jobs, but it will transform how every job is done. "I am certain 100% of everybody's jobs will be changed," he told CNN's Fareed Zakaria on Sunday. "The work that we do in our jobs will be changed. The work will change. But it's very likely — my job has already changed." "Some jobs will be lost. Many jobs would be created. And what I hope is that the productivity gains that we see in all the industries will lift society," he added. Demis Hassabis, the cofounder of Google DeepMind, said in June that AI would create "very valuable jobs" and "supercharge sort of technically savvy people who are at the forefront of using these technologies."


Bloomberg
4 minutes ago
- Bloomberg
Elon Musk Is Cashing In on the AI Romance Boom
Elon Musk is a no-holds-barred kind of tech billionaire. So too are his new AI companions. Two characters have been added this week to Grok, the chatbot developed by Musk's company xAI, including a flirtatious girl with all the hallmarks of a manga character: enormous eyes, thigh-high fishnet stockings and an exaggerated hourglass figure. Musk spent the early hours of Wednesday promoting the character on X, pointing to how Grok was climbing the app store ranks across the world.