
OpenAI's Latest ChatGPT AI Models Are Smarter, But They Hallucinate More Than Ever
Artificial intelligence is evolving fast, but not always in the right direction. OpenAI's latest models, GPT o3 and o4-mini, were built to mimic human reasoning more closely than ever before.
However, a recent internal investigation reveals an alarming downside: these models may be more intelligent, but they're also more prone to making things up. Hallucination in AI is a Growing Problem OpenAI's Latest ChatGPT AI Models Are Smarter, But They Hallucinate
Since the birth of chatbots, hallucinations, also known as false or imaginary facts, have been a persistent issue. With each model iteration, the hope was that these AI hallucinations would decline. But OpenAI's latest findings suggest otherwise, according to The New York Times.
In a benchmark test focused on public figures, GPT-o3 hallucinated in 33% of responses, twice the error rate of its predecessor, GPT-o1. Meanwhile, the more compact GPT o4-mini performed even worse, hallucinating nearly half the time (48%). Reasoning vs. Reliability: Is AI Thinking Too Hard?
Unlike previous models that were great at generating fluent text, o3 and o4-mini were programmed to reason step-by-step, like human logic. Ironically, this new "reasoning" technique might be the problem. AI researchers say that the more a model does reasoning, the more likely it is to go astray.
Unlike low-flying systems that stay with secure, high-confidence responses, these newer systems attempt to bridge between complicated concepts, which can cause bizarre and incorrect conclusions.
On the SimpleQA test, which tests general knowledge, the performance was even worse: GPT o3 hallucinated on 51% of responses, while o4-mini shot to an astonishing 79%. These are not small errors; these are huge credibility gaps. Why More Sophisticated AI Models May Be Less Credible
OpenAI attributes the rise in AI hallucinations to possibly not being the result of the reasoning itself, but of the verbosity and boldness of the models. While attempting to be useful and comprehensive, the AI begins to guess and sometimes mixes theory with fact. The outcome will sound very convincing, but they're entirely incorrect answers.
According to TechRadar, this becomes especially risky when AI is employed in high-stakes environments such as law, medicine, education, or government service. A single hallucinated fact in a legal brief or medical report could have disastrous repercussions. The Real-World Risks of AI Hallucinations
We already know attorneys were sanctioned for providing fabricated court citations produced by ChatGPT. But what about minor mistakes in a business report, school essay, or government policy memo? The more integrated AI becomes into our everyday routines, the fewer opportunities there are for error.
The paradox is simple: the more helpful AI is, the more perilous its mistakes are. You can't save people time if they still need to fact-check everything. Treat AI Like a Confident Intern
Though GPT o3 and o4-mini demonstrate stunning skills in coding, logic, and analysis, their propensity to hallucinate means users can't rely on them when they require rock-solid facts. Until OpenAI and its rivals are able to minimize these hallucinations, users need to take AI output with a grain of salt.
Consider it this way: These chatbots are similar to that in-your-face co-worker who always has a response, but you still fact-check everything they state.
Originally published on Tech Times

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Int'l Business Times
4 hours ago
- Int'l Business Times
You're Already Using Voice AI at Work — Here's How It's Changing Everything
Welcome to Tech Times' AI EXPLAINED , where we look at the tech of today and tomorrow. Brought to you by Using voice AI at work is already happening. You might think AI at work means typing prompts into ChatGPT or getting a slick summary from your inbox. But the real shift is happening in your ears. From whispered prompts during Zoom calls to voice bots taking fast food orders, AI is quietly changing how we work, and you might not even realize it's there. Voice AI isn't just a flashy gadget or customer service gimmick. It's becoming a built-in layer across industries: transcribing your meetings, answering your calls, and even drafting your doctor's notes. As tools like Microsoft Copilot, and drive-thru voice bots quietly embed themselves in daily workflows, they're not only reshaping tasks, they're shifting what it means to show up, speak up, and stay relevant in a workplace that's learning to listen. Game studio founder and voice AI advisor Mike Sorrenti is bullish on the tech. "Voice ai is an excellent thing. It can be used for translation and many other things and if a very natural interface for kids, and older adults especially those with mild disabilities such as arthritis," he said in an email. Enhancing Workplace Productivity with Voice AI Visitors interact with Microsoft Copilot demos at a tech event, where AI-powered assistants like voice-enabled copilots are reshaping how we collaborate at work. A recent UK government study showed that civil servants who used Microsoft's Copilot AI for administrative tasks saved an average of 26 minutes a day, which works out to about two weeks of gained time per year. Voice AI plays a key role in this shift from the pre-AI workplace to the current AI workplace, becoming useful in transcription, summarization, and virtual meeting assistants. Previously, you'd have to have a human assistant sit in on your Zoom or Microsoft Teams meetings to take notes, or delegate someone who may or may not be good at live note-taking to manage these. Voice AI assistants can now log, transcribe and summarize your meetings, often without the people in the meeting really understanding what's being recorded or how it's being analyzed. Workers are already adjusting how and when they speak during meetings, knowing that their words might be captured and re-contextualized by AI summaries. has also introduced a voice-activated AI version of its AI meeting assistant, letting users join and manage meetings without having to take notes. Now that we're used to asking Siri to take a note or set a timer, asking your AI voice agent to start recording will be a cinch. The shift from passive transcription to an interactive agentic AI system is subtle, but it is here. Users can even give verbal commands mid-meeting, obviating the need for someone who understands the user interface of an app, for example, or web controls, all in real-time. This isn't a backend utility, either; rather, tools like Otter are becoming active participants in meetings, giving us a glimpse at a future where AI agents might take on even more collaborative roles. While tools like this save time and offer easier control, the issues of privacy, consent, surveillance, and etiquette in meetings (especially in hybrid or remote work environments can be substantive. But first, a quick casting call for our sponsor and partner of AI EXPLAINED this month, PlaudAI - the No.1 AI note-taker that does exactly what you'd imagine an AI assistant can do in 2025. Do you use voice AI in your daily work? Are you a journalist, doctor, lawyer, educator, or creator doing something smart with your voice? We're spotlighting modern professionals for a new series with our sponsor, — share your story with us here , and you could be featured and receive a sample of PlaudAI. Voice AI Has Already Transformed Customer Service and Retail Experiences A call center employee works with a headset, illustrating how voice-based tools have long been part of the workplace — and how AI is now transforming those interactions. As a consumer, driving through the fast-food lane can be a common experience. You pull up to the speaker, ask for the food you want, and go grab it after you pay at the next window. If you've been in one recently, you may have heard some rudimentary voice AI ask you if you used a mobile ordering app. But that's all about to get even more AI-powered. Sophie Fennelly knows that voice AI is happening now. "As CEO of Sales TQ, with over a decade in enterprise sales and an analytical background from MIT, I've helped small businesses adopt emerging technologies—including voice AI—to improve operations," she wrote in an email. "Voice AI is no longer just futuristic hype; it is actively reshaping how we work and live." Hungry Jack's, a fast food chain in Australia, just began trials of a new AI-driven voice assistant for its own drive-through orders. Customers appreciated the system's politeness while the company likely appreciates the potential for late-night staffing. Employees, though, might start to feel the pinch, or start moving to parts of the job that tech can't yet replicate, like solving problems, connecting with people, or hopping in when the system makes a mistake. Australian franchises of KFC have also started testing out AI-driven voice ordering, though customers there have pushed back a bit, saying they prefer human interaction and their orders being misinterpreted by the AI window jockey. Still, even this workplace is evolving as AI starts to support and replace humans in these high-turnover, low-wage roles. As a consumer, you might have to learn how to talk to AI bots, and as a worker, you'll likely need to up your skills to fit roles that need a more human touch. Revolutionizing Healthcare with Ambient Voice AI A doctor speaks during an interview, highlighting how voice AI tools are increasingly being adopted in healthcare settings — from clinical dictation to patient notes. Microsoft's DAX Copilot automates recordkeeping, updates patient records, and can draft care plans all from just listening in to conversations between doctors and patients. The idea is that physicians can spend more time on patient care than on record keeping, thereby improving health outcomes for more people. Microsoft is also transforming the workplace of nurses, with ambient voice and other AI technologies becoming integrated into nursing workflows, alleviating the burden of so much documentation, reducing burnout, and allowing nurses to spend more time on patient care. Global Reach and No-Code Tools for Voice AI A startup in Bengaluru (formerly Bangalore) in southern India is offering a no-code conversational AI platform that lets other businesses deploy real-time, multilingual voice agents for customer support and services. Ring AI has expanded beyond India to the Middle East and Latin America, supporting languages like Arabic and Spanish. Global workplaces that couldn't afford the high-cost development of their own voice AI systems can now contract with Ringg AI and deploy AI agents to help them with their businesses. While that does mean that AI tools may replace workers at major companies, it does mean that smaller shops can afford the cost of entry to compete on the level of voice AI at larger corporations. Gev Balyan, CEO of Hoory AI says, "Startups and solopreneurs now have an always-on assistant — answering calls, qualifying leads, booking — at enterprise scale, without enterprise cost." What It All Means For You A glowing AI key on a sleek keyboard symbolizes how artificial intelligence — including voice-powered assistants — is becoming a built-in part of everyday work tools. So what does the current adoption of Voice AI mean for you. Whether you're running (or taking) meetings, answering customer support calls, or just trying to get through all the email you receive in a day, Voice AI is likely becoming a bigger part of your day, whether you realize it or not. The tools are changing how we connect with colleagues and higher-ups, the tasks that make up our daily work, and even what kinds of jobs are growing (or shrinking). As an employee, it could mean you'll do fewer routine tasks and spend more time thinking, making decisions, or even managing these AI tools directly. But you'll still need to learn to work with AI, knowing what it's good at, how to manage it when if makes mistakes, and how to communicate with AI tools that are always listening. As a business owner, you might feel like AI will boost your workers' productivity while reducing costs, but you'll also need to keep an eye out on issues of privacy, trust, and training. You may want to start hiring people who have higher levels of judgment, empathy, and communication to be able to work with your new AI systems. Bottom line, AI isn't coming for your job all at once, but it is coming to change parts of your job already. The best thing here is to stay relevant by focusing on what you can do that AI can't, yet: connect with others, understand nuance, and adapt when things go off the rails. Originally published on Tech Times


DW
7 hours ago
- DW
India is reaching for its own world-class AI engine – DW – 06/05/2025
Despite thousands of AI startups cropping up in India, the nation has yet to produce its own premium AI engine. Why is there no Indian version of DeepSeek or ChatGPT? With over 5 million IT workers and a growing focus on artifical intelligence (AI) in education, Indiaseems to be in an ideal position for what is shaping up to be a global race in AI technology. But while the US set the global AI standard with ChatGPT in 2023, and China quickly closed the gap with its powerful chatbot DeepSeek, India has yet to produce an equivalent Large Language Model (LLM) that can mimic human communication. And it's not for the lack of ambition. According to data from market intelligence firm Tracxn, the Indian AI sector includes 7,114 startups that have collectively raised $23 billion (€20.15 billion) in equity funding so far. Last year, Prime Minister Narendra Modi's Cabinet approved the IndiaAI Mission initiative with a budget of nearly $1.21 billion, aiming to "undertake the development and deployment of indigenous Large Multimodal Models (LMMs) and domain-specific foundational models in critical sectors." DeepSeek: Cheap AI model shakes up tech industry To view this video please enable JavaScript, and consider upgrading to a web browser that supports HTML5 video This week, IndiaAI Mission CEO Abhishek Singh said Indian startups need to think beyond their home turf to compete and succeed against global giants. "They will have to ultimately compete with the best in the world," Singh said at the Accel AI Summit in Bengaluru. "Initial level of support may come from the government, but that will not sustain them in the long run." "They will have to have a global vision in mind when they are training models," Singh said. AI development needs industry, government and academia Representatives of the National Association of Software and Service Companies (NASSCOM), the voice of India's $283 billion tech industry, point out that building a globally recognized AI model is a complex, resource-intensive process. "The argument is not whether India can catch up, but whether we can move fast enough and define an AI identity on our own," Satyaki Maitra, senior manager communications of NASSCOM, told DW. Last week, IndiaAI Mission announced the addition of 15,916 Graphics Processing Units (GPUs), which are essential for AI research due to their capacity to perform calculations in parallel. The latest boost will bring the total national AI computing capacity to 34,333 GPUs through public-private partnerships. NVIDIA profits up, despite trade war challenges To view this video please enable JavaScript, and consider upgrading to a web browser that supports HTML5 video Startups such as Gan AI, Gnan AI, SarvamAI and Soket AI, supported by the IndiaAI Mission, are building foundational models tailored to India, while firms like Sarvam AI, Fractal and CoRover AI are focusing on AI innovation. "However, AI success cannot be achieved through isolated innovation," said Maitra. "It requires cohesive collaboration between government, industry, and academia to build the full value chain, from compute and data governance to model training and real-world deployment." What is holding Indian AI back? Pawan Duggal, the country's foremost cybersecurity expert, told DW that India is likely to face a shortage of high-end AI hardware, limited access to advanced GPUs, and insufficient cloud computing resources, which are essential for training large-scale AI models. "There is also a significant investment shortfall when compared to global peers. While venture capital investment in Indian AI startups has increased, it remains a fraction of what is seen in the US or China," said Duggal. "The US invested $2.34 trillion and China $832 billion in ventures and startups from 2014 to 2023, while India invested $145 billion in the same period," he added. Can AI blackmail us? Facts vs. fear To view this video please enable JavaScript, and consider upgrading to a web browser that supports HTML5 video Duggal believes that India is already moving towards creating its own AI model, but it has yet to address significant challenges including infrastructure, funding, talent, data, and regulation. 'Brains are plenty in India' Another issue facing Indian engineers is the diversity of languages in India, with English being just one of 22 official languages in the world's most populous country. Furthermore, official languages make up only a tiny faction of over 1,600 languages spoken within its borders. "The only use-case of an 'Indian' LLM is if it works in our various languages which is difficult as of now given that there is a lack of quality data for LLMs to train on in most Indian languages," Yash Shah of Momentum 91, a leading custom software development company, told DW. "For an LLM in English, there are other companies and countries which are far ahead of us and would continue to be that way," said Shah. 'Godfather of AI' urges tougher government rules To view this video please enable JavaScript, and consider upgrading to a web browser that supports HTML5 video However, Utpal Vaishnav of Upsquare Technologies, a global technology holding company, says the real obstacles are actually "risk-shy investors, patchy data rules, and tight GPU supply." "Brains are plenty in India. GPUs are on the way and our multilingual data is waiting to be shaped. Give this talent, patient capital and clear problems and a compact, world-class LLM can be launched in two, three years," Vaishnav told DW. Edited by: Darko Janjevic


Int'l Business Times
2 days ago
- Int'l Business Times
AI Startup Backed by Microsoft Revealed to Be 700 Indian Employees Pretending to Be Chatbots
A once-hyped AI startup backed by Microsoft has filed for bankruptcy after it was revealed that its so-called artificial intelligence was actually hundreds of human workers in India pretending to be chatbots. a London-based company previously valued at $1.5 billion, marketed its platform as an AI-powered solution that made building apps as simple as ordering pizza. Its virtual assistant, "Natasha," was supposed to generate software using artificial intelligence. In reality, nearly 700 engineers in India were manually coding customer requests behind the scenes, the Times of India reported. The ruse began to collapse in May when lender Viola Credit seized $37 million from the company's accounts, uncovering that had inflated its 2024 revenue projections by 300%. An audit revealed the company generated just $50 million in revenue, far below the $220 million it claimed to investors. A Wall Street Journal report from 2019 had already questioned AI claims, and a former executive sued the company that same year for allegedly misleading investors and overstating its technical capabilities. Despite that, the company raised over $445 million from big names including Microsoft and the Qatar Investment Authority. collapse has triggered a federal investigation in the U.S., with prosecutors in New York requesting financial documents and customer records. Founder Sachin Dev Duggal stepped down earlier this year and was replaced by Manpreet Ratia, who reportedly uncovered the company's internal misrepresentations. The company now owes millions to Amazon and Microsoft in cloud computing costs and has laid off around 1,000 employees. On LinkedIn , the company announced its entry into insolvency proceedings, citing "historic challenges and past decisions" that strained its finances. The fallout is seen as one of the biggest failures of the post-ChatGPT AI investment boom and has renewed scrutiny of "AI washing"—the trend of rebranding manual services as artificial intelligence to secure funding. Originally published on Latin Times