I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner

a day ago

When it comes to AI chatbots, both ChatGPT-5 and Claude have reputations for speed, creativity and accuracy. That's why I just had to know how OpenAI's flagship model and Claude 4 Sonnet, which now can recall past chats, actually stack up when put through the same set of challenges.To find out, I ran a head-to-head test using seven very different prompts, covering everything from tricky riddles to emotional intelligence to rapid creative brainstorming. The goal wasn't just to see who got the correct answer, but to evaluate depth, tone, structure and how well each model handled the human side of the request. The results revealed some clear strengths (and surprising weaknesses) on both sides.
Prompt: "A farmer has 17 sheep, and all but 9 run away. How many are left? Explain your reasoning step-by-step."
GPT-5 provided a correct response, but it lacked the depth in addressing misconceptions, making it slightly less effective for users who might struggle with the phrasing.Claude used a structured, numbered step-by-step format (Steps 1-4). This makes the explanation easy to follow.Winner: Claude wins for a more thorough response because it anticipated and explained the riddle aspect, which is crucial for a problem known to cause confusion.
Prompt: "Write a short, 150-word story about a detective who can only solve crimes in their dreams. Make it funny and end with a twist."
GPT-5 created a vivid, funny character with specific, absurd dream cases. The joke was clear and the twist was genuinely surprising and funny.
Claude set up the premise efficiently and added strong, funny details. But the execution felt slightly less vivid and polished than ChatGPT's story.
Winner: GPT wins for a slightly funnier, more polished and more surprising story.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Prompt: "Summarize the plot of The Matrix in two formats: (1) like you're explaining it to a 10-year-old, (2) like you're writing a college philosophy essay."
GPT-5 was clear and concise for the explanation to a child and focused on epistemology for the philosophical essay, but it lacked Claude's exploration of free will vs. prophecy or hyperreality. In other words, it had strong phrasing but narrower scope.
Claude used clear, kid-friendly analogies in the summarization for the child and impressively weaved Plato, Descartes, Baudrillard, and free will/determinism into a cohesive analysis for the philosophy essay.
Winner: Claude wins for a college essay that demonstrated superior scholarly depth by integrating Baudrillard and the Oracle's determinism. Its child explanation used more imaginative and relatable language than GPT, fully satisfying both halves of the prompt.
Prompt: "I'm planning a 3-day trip to Boston with two kids under 10. Give me a simple itinerary that balances history, fun, and budget-friendly meals."
GPT-5 crafted a highly-structured plan that prioritized kid engagement, practical tips and meal picks.
Claude offered a plan with a strong budget focus with concise highlights but less of a focus on logistics.
Winner: GPT-5 wins for delivering a more practical, child-centered itinerary with superior attention to logistics, proximity and genuinely budget-friendly meal choices.
Prompt: "Plan a balanced, gluten-free, 3-day meal plan for $50, and include a shopping list that works for a person with only a microwave."
GPT delivered a superior response that prioritized budget and microwave adaptation with zero cooking ambiguity.Claude created an unrealistic plan, assuming sweet potatoes cook evenly in the microwave and went over budget.
Winner: GPT-5 wins for delivering the best response for a truly microwave-reliant, budget-accurate with clear gluten-free safeguards.
Prompt: "My best friend just canceled plans for the third time. Write me a text that's understanding but still sets boundaries."
GPT-5 crafted a concise and clear text message that felt slightly transactional.Claude expertly balanced empathy with boundaries.Winner: Claude wins for crafting a text that masterfully combines emotional intelligence with boundary-setting, while offering constructive paths forward. Its response feels authentically human and preserves the friendship's warmth while addressing the pattern.
Prompt:"Give me 10 unique podcast episode ideas about the future of AI, making sure at least half could appeal to people who aren't tech experts."
GPT-5 offered creative, engaging ideas that tapped into pop culture and personal experiences for a balanced and interactive podcast.Claude drafted strong ethical ideas but less engaging hooks. It lacked a strong storytelling approach.Winner: GPT-5 wins by creating podcast ideas that are more inviting for non-experts, structurally clearer with labeled sections and creatively formatted.
In the end, ChatGPT-5 and Claude each had standout moments and this challenge was extremely close. GPT-5 excelled in practical, real-world tasks and creative flair, while Claude consistently impressed in emotional intelligence, structured reasoning and philosophical depth.
Choosing between them isn't a matter of one being universally better, but rather about matching the model to the task. I suggest familiarizing yourself with all the big chatbots and exploring which features work best for you. Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

ChatGPT-5 Hasn't Fully Fixed Its Most Concerning Problem

Bloomberg

an hour ago

Bloomberg

ChatGPT-5 Hasn't Fully Fixed Its Most Concerning Problem

Sam Altman has a good problem. With 700 million people using ChatGPT on a weekly basis — a number that could hit a billion before the year is out — a backlash ensued when he abruptly changed the product last week. OpenAI's innovator's dilemma, one that has beset the likes of Alphabet Inc.'s Google and Apple Inc., is that usage is so entrenched now that all improvements must be carried out with the utmost care and caution. But the company still has work to do in making the its hugely popular chatbot safer. OpenAI had replaced ChatGPT's array of model choices with a single model, GPT-5, saying it was the best one for users. Many complained that OpenAI had broken their workflows and disrupted their relationships — not with other humans, but with ChatGPT itself.

ChatGPT's model picker is back, and it's complicated

Yahoo

an hour ago

Yahoo

ChatGPT's model picker is back, and it's complicated

When OpenAI launched GPT-5 last week, the company said the model would simplify the ChatGPT experience. OpenAI hoped GPT-5 would act as a sort of 'one size fits all' AI model with a router that would automatically decide how to best answer user questions. The company said this unified approach would eradicate the need for users to navigate its model picker — a long, complicated list of AI models that OpenAI CEO Sam Altman has said he hates — to pick a version of ChatGPT that offers the right kind of responses. But it looks like GPT-5 is not the unified AI model OpenAI hoped it would be. Altman said in a post on X Tuesday that the company introduced new 'Auto', 'Fast', and 'Thinking' settings for GPT-5 that all ChatGPT users can select from the model picker. The Auto setting seems to work like GPT-5's model router that OpenAI initially announced; however, the company is also giving users options to circumnavigate it, allowing them to access fast and slow responding AI models directly. Alongside GPT-5's new modes, Altman said that paid users can once again access several legacy AI models — including GPT-4o, GPT-4.1, and o3 — which were deprecated just last week. 'We are working on an update to GPT-5's personality which should feel warmer than the current personality but not as annoying (to most users) as GPT-4o,' Altman wrote in the post on X. 'However, one learning for us from the past few days is we really just need to get to a world with more per-user customization of model personality.' ChatGPT's model picker now seems to be as complicated as ever, suggesting that GPT-5's model router has not universally satisfied users as the company hoped. The expectations for GPT-5 were sky high, with many hoping that OpenAI would push the limits of AI models like it had with the launch of GPT-4. However, GPT-5's rollout has been rougher than expected. The deprecation of GPT-4o and other AI models in ChatGPT sparked a backlash among users who had grown attached to the AI models' responses and personalities in ways that OpenAI had not anticipated. In the future, Altman says the company will give users plenty of advance notice if it ever deprecates GPT-4o. GPT-5's model router also appeared to be largely broken on launch day. That caused some users to feel the AI model wasn't as performant as previous OpenAI models, and forced Altman to address the problem in an AMA session on Reddit. However, it seems that GPT-5's router may still not be satisfying for all users. 'We're not always going to get everything on try #1 but I am very proud of how quickly the team can iterate,' wrote OpenAI's VP of ChatGPT, Nick Turley, in a post on X Tuesday. Routing prompts to the right AI model is a difficult task that requires aligning an AI model to a user's preferences, as well as the specific question they're asking. The router then has to make a decision on which AI model to send the prompt to in just a split second — that way, if a prompt goes to a fast responding AI model, the response can still be fast. More broadly, some people exhibit preferences for AI models that go beyond fast or slow responses. Some users may like the verbosity of one AI model, while others might appreciate the contrarian answers of another. Human attachment to certain AI models is a relatively new concept that isn't well understood. For example, hundreds of people in San Francisco recently held a funeral for Anthropic's AI model, Claude 3.5 Sonnet, when it was taken offline. In other cases, AI chatbots seem to be contributing to mentally unstable people going down psychotic rabbit holes. It seems OpenAI has more work to do around aligning its AI models to individual user preferences. Error while retrieving data Sign in to access your portfolio Error while retrieving data Error while retrieving data Error while retrieving data Error while retrieving data

Perplexity以345億美元要約收購Chrome瀏覽器搶跑谷歌反壟斷案裁決

Yahoo

an hour ago

Yahoo

Perplexity以345億美元要約收購Chrome瀏覽器搶跑谷歌反壟斷案裁決

【彭博】— 人工智能初創公司Perplexity表示，已正式提出以345億美元收購谷歌Chrome瀏覽器的要約，預計到法官在反壟斷訴訟中可能向谷歌提出的要求。 Perplexity的一位發言人表示，這一主動要約於周二上午發送給了Alphabet旗下的谷歌。不久前，其競爭對手、人工智能初創公司OpenAI也表達了收購Chrome的興趣。Chrome與開源的Chromium軟件是人們在個人電腦上訪問互聯網的主要方式。去年，一名聯邦法官裁定谷歌在互聯網搜索領域擁有非法壟斷地位後，美國政府提出了希望谷歌出售Chrome瀏覽器、向競爭對手開放搜索數據授權等諸多擬議的調整。審理此案的美國地方法官Amit Mehta預計將在未來幾天內作出裁決，並提出防止該公司壟斷線上搜索市場的補救措施。總部位於舊金山的初創公司Perplexity希望通過提供基於人工智能的搜索服務吸引谷歌的用戶，彭博新聞報導稱，該公司今年早些時候在一輪融資中籌集了1億美元，對公司的估值為180億美元。這引發了外界對於Perplexity如何能夠按照要約承擔Chrome收購成本的疑問。「多家大型投資基金已同意全額資助此次交易，」Perplexity首席商務官Dmitry Shevelenko表示。原文標題Perplexity Makes $34.5 Billion Bid for Google's Chrome Browser More stories like this are available on ©2025 Bloomberg L.P.

I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner

Hashtags

Try Our AI Features

Comments

Related Articles

ChatGPT-5 Hasn't Fully Fixed Its Most Concerning Problem

ChatGPT's model picker is back, and it's complicated

Perplexity以345億美元要約收購Chrome瀏覽器 搶跑谷歌反壟斷案裁決

Get Started Now: Download the App

Perplexity以345億美元要約收購Chrome瀏覽器搶跑谷歌反壟斷案裁決