logo
The latest ChatGPT is supposed to be ‘PhD level' smart. It can't even label a map

The latest ChatGPT is supposed to be ‘PhD level' smart. It can't even label a map

Yahoo2 days ago
A version of this story appeared in CNN Business' Nightcap newsletter. To get it in your inbox, sign up for free here.
Sam Altman, the artificial intelligence hype master, is in damage-control mode.
OpenAI's latest version of its vaunted ChatGPT bot was supposed to be 'PhD-level' smart. It was supposed to be the next great leap forward for a company that investors have poured billions of dollars into.
Instead, ChatGPT got a flatter, more terse personality that can't reliably answer basic questions. The resulting public mockery has forced the company to make sweaty apologies while standing by its highfalutin claims about the bot's capabilities.
In short: It's a dud.
The misstep on the model, called GPT-5, is notable for a couple of reasons.
1. It highlighted the many existing shortcomings of generative AI that critics were quick to seize on (more on that in a moment, because they were quite funny).
2. It raised serious doubts about OpenAI's ability to build and market consumer products that human beings are willing to pay for. That should be particularly concerning for investors, given OpenAI, which has never turned a profit, is reportedly worth $500 billion.
Let's rewind a bit to last Thursday, when OpenAI finally released GPT-5 to the world — about a year behind schedule, according to the Wall Street Journal. Now, one thing this industry is really good at is hype, and on that metric, CEO Sam Altman delivered.
During a livestream ahead of the launch last Thursday, Altman said talking to GPT-5 would be like talking to 'a legitimate PhD-level expert in anything, any area you need.'
In his typically lofty style, Altman said GPT-5 reminds him of 'when the iPhone went from those giant-pixel old ones to the retina display.' The new model, he said, is 'significantly better in obvious ways and subtle ways, and it feels like something I don't want to ever have to go back from,' Altman said in a press briefing.
Then people started actually using it.
Users had a field day testing GPT-5 and mocking its wildly incorrect answers.
The journalist Tim Burke said on Bluesky that he prompted GPT-5 to 'show me a diagram of the first 12 presidents of the United States with an image of their face and their name under the image.'
The bot returned an image of nine people instead, with rather creative spellings of America's early leaders, like 'Gearge Washingion' and 'William Henry Harrtson.'
A similar prompt for the last 12 presidents returned an image that included two separate versions of George W. Bush. No, not George H.W. Bush, and then Dubya. It had 'George H. Bush.' And then his son, twice. Except the second time, George Jr. looked like just some random guy.
Labeling basic maps of the United States also proved tricky for GPT-5 (but again, pretty funny, as tech writer Ed Zitron's post on Bluesky showed).
GPT-5 did slightly better when I asked it on Wednesday for a map of the US. Some people can, in fact, label the great state of Vermont correctly without a PhD, but not GPT-5. And this is the first I'm hearing of states named 'Yirginia.'
The slop coming out of GPT-5 was funny when it was just us nerds trying to find its blind spots. But some regular fans of ChatGPT weren't laughing. Especially because users have been particularly alarmed by the new version's personality – or rather, lack thereof.
In rolling out the new model, OpenAI essentially retired its earlier models, including the wildly popular GPT-4o that's been on the market for over a year, making it so that even people who loved the previous iteration of the chatbot suddenly couldn't use it. More than 4,000 people signed a Change.org petition to compel OpenAI to resurrect it.
'I'm so done with ChatGPT 5,' one user wrote on Reddit, explaining how they tried to use the new model to run 'a simple system' of tasks that an earlier ChatGPT model used to handle. The user said GPT-5 'went rogue,' deleting tasks and moving deadlines.
And while OpenAI's defenders could chalk that up to an isolated or even made-up incident, within 24 hours of the GPT-5 launch Altman was doing damage control, seemingly caught of guard by the bad reception. On X, he announced a laundry list of updates, including the return of GPT-4o for paid subscribers.
'We expected some bumpiness as we roll out so many things at once,' Altman said in a post. 'But it was a little more bumpy than we hoped for!'
The CEO's failure to anticipate the outrage suggests he doesn't have a firm grasp on how an estimated 700 million weekly active users are engaging with his product.
Perhaps Altman missed all the coverage — from CNN, the New York Times, the Wall Street Journal — of people forming deep emotional attachments to ChatGPT or rival chatbots, having endless conversations with them as if they were real people. A simple search of Reddit could have offered insights into how others are integrating the tool into their workflows and lives. Basic market research should have shown OpenAI that a mass update sunsetting the tools people rely on would be more than just a bit bumpy.
When asked about the backlash to GPT-5, an OpenAI representative pointed CNN to Altman's public statements on social media announcing the return of older models, as well as a blog post about how the company is optimizing GPT-5.
The messy rollout speaks to how the AI industry as a whole is struggling to prove themselves as producers of consumer goods rather than 'labs' — as they love to call themselves, because it sounds more scientific and distracts people from the fact that they are backed by people who are trying to make unfathomable amounts of cash for themselves.
AI companies often base their fanfare around how a model performs in various behind-the-scenes benchmark tests that show how well a bot can do complex math. For all we know, GPT-5 sailed through those evaluations.
But the problem is that OpenAI hyped the thing so far into the stratosphere, disappointment was (or should have been) inevitable.
'I honestly didn't think OpenAI would burn the brand name on something so mid,' wrote prominent researcher and AI critic Gary Marcus. 'In a rational world, their valuation would take a hit,' he added, noting OpenAI still hasn't turned a profit, is slashing prices to keep its user numbers up, and is hemorrhaging talent as competition heats up.
For critics like Marcus, the GPT-5 flop was a kind of vindication. As he noted in a blog post, other models like Elon Musk's Grok aren't faring much better, and the backlash from even AI proponents feels like a turning point.
When people talk about AI, they're talking about one of two things: the AI we have now — chatbots with limited, defined utility — and the AI that companies like Altman's claim they can build — machines that can outsmart humans and tell us how to cure cancer, fix global warming, drive our cars and grow our crops, all while entertaining and delighting us along the way.
But the gap between the promise and the reality of AI only seems to widen with every new model.
CNN's Lisa Eadicicco contributed reporting.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Discover Yourself Named to Inc. 5000 for Second Time
Discover Yourself Named to Inc. 5000 for Second Time

Yahoo

time8 minutes ago

  • Yahoo

Discover Yourself Named to Inc. 5000 for Second Time

Using the universal language of color, the company helps organizations worldwide build stronger teams, improve communications and accelerate performance EXCELSIOR, Minn., August 16, 2025--(BUSINESS WIRE)--Discover Yourself, the world's largest distributor of the Insights Discovery® personality assessment, has once again been named to the Inc. 5000 list of America's Fastest-Growing Private Companies. This second-time honor places the Minnesota-based training firm alongside some of the country's most innovative and dynamic businesses. The Inc. 5000 list is widely regarded as the hallmark of entrepreneurial success, celebrating innovation, customer impact and sustained revenue growth. Earning a spot on the list twice reflects Discover Yourself's strong performance, and its enduring ability to adapt and deliver value in a rapidly changing workplace environment. Easily the most colorful company on the Inc. 5000 list Founded by internationally recognized speaker, author and trainer Scott Schwefel, Discover Yourself transforms communication and teamwork by teaching the universal language of color. Through the Insights Discovery framework, individuals identify their unique blend of Fiery Red, Sunshine Yellow, Earth Green and Cool Blue energies, unlocking self-awareness and awareness of others. This approach enables participants to reduce misunderstandings, build stronger connections and collaborate more effectively. "Most people who train with us have done other personality assessments but rarely use them. With ours, they start applying it immediately because every conversation, every email and every interaction gets better," says Schwefel. Discover Yourself's programs — ranging from leadership development and sales training to executive coaching and team-building workshops — are trusted by global brands such as Caterpillar, 3M, Whirlpool, Zendesk and Workday. The company is actively expanding its reach with new online learning platforms, scalable virtual workshops and digital resources to make its programs accessible to teams anywhere in the world. Its reach spans all 50 states and more than 30 countries, powered in part by newly launched digital clones that can deliver on-demand training in 170 languages. Schwefel, one of only 60 global faculty certified to train other trainers, has personally taken 4,000 CEOs through Insights Discovery training, and has spoken to more than 2000 groups. His TED Talk has nearly four million views. He is also a popular keynote speaker worldwide. "Being named to the Inc. 5000 is more than an award. It's proof our methods work, and that organizations everywhere are hungry for meaningful change," comments Schwefel. Discover Yourself has grown from a two-person operation to 17 employees, who support a global network of more than 300 Insights Discovery certified facilitators. Clients reporting measurable boosts in collaboration, productivity and leadership capability. Visit Discover Yourself to take a one-minute quiz to discover where you fall in the color model. If you'd never experienced Insights Discovery and would like to learn more, contact Scott@ View source version on Contacts Scott@

Piper Sandler Reiterates a Buy Rating on UnitedHealth Group (UNH)
Piper Sandler Reiterates a Buy Rating on UnitedHealth Group (UNH)

Yahoo

time8 minutes ago

  • Yahoo

Piper Sandler Reiterates a Buy Rating on UnitedHealth Group (UNH)

UnitedHealth Group Incorporated (NYSE:UNH) is one of the best stocks to invest in for beginners. Piper Sandler analyst Jessica Tassan reiterated a Buy rating on UnitedHealth Group Incorporated (NYSE:UNH) on August 5, setting a price target of $280.00. A senior healthcare professional giving advice to a patient in a clinic. UnitedHealth Group Incorporated (NYSE:UNH) reported that its Q2 2025 revenue underwent a $12.8 billion year-over-year growth to $111.6 billion, attributed primarily to growth within Optum and UnitedHealthcare. Earnings from operations for the quarter were $5.2 billion, while adjusted net earnings were $4.08 per share. Management reported a full year 2025 revenue outlook in the $445.5 billion to $448.0 billion range, with a full year 2025 earnings outlook of at least $14.65 per share. UnitedHealth Group Incorporated (NYSE:UNH) provides healthcare coverage, data consultancy, and software services. It operates through the OptumRx, OptumInsight, OptumHealth, and UnitedHealthCare segments, which have solid operations. While we acknowledge the potential of UNH as an investment, we believe certain AI stocks offer greater upside potential and carry less downside risk. If you're looking for an extremely undervalued AI stock that also stands to benefit significantly from Trump-era tariffs and the onshoring trend, see our free report on the best short-term AI stock. READ NEXT: 30 Stocks That Should Double in 3 Years and 11 Hidden AI Stocks to Buy Right Now. Disclosure: None. This article is originally published at Insider Monkey. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Jim Cramer on International Business Machines: 'I Think They Have the Lead in Quantum'
Jim Cramer on International Business Machines: 'I Think They Have the Lead in Quantum'

Yahoo

time8 minutes ago

  • Yahoo

Jim Cramer on International Business Machines: 'I Think They Have the Lead in Quantum'

International Business Machines Corporation (NYSE:IBM) is one of the stocks Jim Cramer shed light on. A caller asked for Cramer's thoughts on the company, and he remarked: 'Okay, I didn't think IBM's quarter… was all that bad at all. I think you have a major opportunity down here because I think that we're going to start talking about IBM and quantum. I think they have the lead in quantum, and I think quantum really does matter. They have a great software package. They're doing so many things that are good. Tauke / International Business Machines Corporation (NYSE:IBM) provides integrated solutions across software, consulting, infrastructure, and financing. The company offers hybrid cloud and AI platforms, technology services, and IT financing solutions. While we acknowledge the potential of IBM as an investment, we believe certain AI stocks offer greater upside potential and carry less downside risk. If you're looking for an extremely undervalued AI stock that also stands to benefit significantly from Trump-era tariffs and the onshoring trend, see our free report on the best short-term AI stock. READ NEXT: 30 Stocks That Should Double in 3 Years and 11 Hidden AI Stocks to Buy Right Now. Disclosure: None. This article is originally published at Insider Monkey.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store