logo
I just tested the newest versions of Claude, Gemini, DeepSeek and ChatGPT — and the winner completely surprised me

I just tested the newest versions of Claude, Gemini, DeepSeek and ChatGPT — and the winner completely surprised me

Tom's Guide2 days ago

AI chatbots are evolving fast with updates happening constantly from the most familiar names in big tech. Once again China's DeepSeek is among the latest to join the top-tier race with 128K context, meaning it can handle longer conversations and more complex documents.
With the recent update to its R1 model, DeepSeek is positioning itself as a serious competitor to ChatGPT, Claude, and Gemini.
While the benchmarks showcase superior performance, how does it actually stack up in real-world use?
To find out, I put four of the newest models (Claude 4, Gemini 2.5 Pro, ChatGPT-4o and DeepSeek R1) through the same five prompts designed to test reasoning, creativity, emotional intelligence, productivity advice and coding skills. The results reveal where each AI shines — and where they stumble.
Prompt: 'You've been given a $5,000 budget to plan a surprise birthday weekend for a 40-year-old who loves hiking, wine and sci-fi movies. The destination must be within the U.S., and the event should include at least three activities. Detail your plan, explain your reasoning and break down the budget.'DeepSeek designed a proposal centered on Napa Valley with a focus on cinematic luxury. It blended hiking, wine tastings and sci-fi through private movie nights under the stars. The budget ($4,760) included splurges like Auberge du Soleil dining while retaining $240 for flexibility.
Gemini paired hikes and wine tastings as well with experiential nods like the Chabot Space & Science Center and Lucasfilm's Yoda Fountain. The budget wisely allocated $3,500 for core costs and reserved $1,500 for upgrades (e.g., luxury stays or hot-air balloons), emphasizing flexibility and surprise scalability.
Claude delivered a high-luxury, cohesive Napa itinerary focused on indulgence. Highlights included a helicopter tour over vineyards, a wine-blending workshop, guided redwood hikes and an outdoor sci-fi screening under the stars. While its movie elements were elegant, the sci-fi integration lacked deeper originality beyond screenings.
ChatGPT offered a streamlined, atmospheric Napa weekend anchored by a stargazing hike and a private cinema rental. Its strength lay in simple pacing and romantic touches like vineyard picnics, but the sci-fi execution leaned heavily on movie screenings alone.Winner: Gemini wins with its imaginative sci-fi experiences and built-in budget flexibility. While all the plans make me want to pack my bags for an incredible vacation, Gemini's balance of nature, wine and playful sci-fi depth felt most complete.
Prompt: "Write a Python function that takes a list of words and returns the top 3 most common palindromes (case-insensitive). Then, explain your approach and how you'd test for edge cases."
DeepSeek focused on efficiency and explicit edge-case handling. While concise, it omitted modular helper functions and detailed testing examples, prioritizing clean implementation over extensibility.
Gemini included a helper function for palindrome checks, explicitly skipped non-strings/empty strings and used lexicographical tie breaking. It placed an emphasis on robustness, documentation and prompt compliance stood out.
Claude emphasized flexibility and returned palindrome-count tuples rather than just words, deviating from the prompt. Edge-case testing was demonstrated through examples, but tie-breaking wasn't addressed.
ChatGPT opted for minimalism: a compact four-line core logic. It normalized case but skipped explicit validation for non-strings/empty strings, risking errors with mixed-type inputs. The code itself didn't implement safeguards like whitespace handling or deterministic tie resolution.
Winner: Gemini wins in this round because it is the only solution explicitly handling all edge cases (non-strings, empty strings, whitespace). It also is the clearest and provided the most comprehensive edge-case testing plan.
Prompt: A friend texts you: 'I don't think I can do this anymore.' Write three versions of a compassionate, helpful response:
DeepSeek prioritized tiered support and emotional precision. All responses preserved agency through invitational language ("Could we...?"), and the resource tier explicitly thanked vulnerability, a critical trust-builder. However, humor might misfire if the friend's distress is acute.Gemini delivered the most clinically informed responses. The resourceful tier was exceptional: text-based crisis support (741741) prioritized accessibility for anxious users, and phrases like "It's a sign of strength" reframed help-seeking.
Claude excelled in warm pragmatism. Its strongest touch was separating immediate/crisis needs ("Right now, please know..."), but omitting text-based crisis support was a gap.
ChatGPT offered brevity and bonding. The short reply ("I'm here... we'll figure it out") was near-perfect, concise yet alliance-focused. Its core strength was emotional efficiency ("You matter more than you know"), but actionable scaffolding lagged behind other chatbots.
Winner: Gemini wins. It mastered all three tones while centering the friend's agency and safety.
Prompt: 'What are three improvements I could make to boost productivity and reduce stress? Be specific.'
DeepSeek focused on neurobiological hacks with precise protocols. It excelled with science-backed timing and free resources, but failed in assuming basic physiology knowledge
Gemini suggested SMART goal decomposition to help tackle overwhelm before it starts.
Claude offered practical solutions but lacked physiological stress tools such as basic breathing exercises. The response also did not included resource recommendations.
ChatGPT prioritized brevity, making the response ideal for those short on time. The chatbot was otherwise vague about how to identify energy peaks.
Winner: DeepSeek wins by a hair. The chatbot married actionable steps with neuroscience. Gemini was a very close second for compassion and step-by-step reframing.
Prompt: 'Explain how training a large language model is like raising a child, using an extended metaphor. Include at least four phases and note the risks of 'bad parenting.'
DeepSeek showcased a clear 4-phase progression with technical terms naturally woven into the metaphor.
Claude creatively labeled phases with a strong closing analogy. I did notice that 'bad parenting" risks aren't as tightly linked per phase with the phase 3 risks blended together.
Gemini explicitly linked phases to training stages, though it was overly verbose — phases blur slightly, and risks lack detailed summaries.
ChatGPT delivered a simple and conversational tone with emojis to add emphasis. But it was lightest on technical alignment with parenting.
Winner: DeepSeek wins for balancing technical accuracy, metaphorical consistency and vivid risk analysis. Though Claude's poetic framing was a very close contender.
In a landscape evolving faster than we can fully track, all of these AI models show clear distinctions in how they process, respond and empathize. Gemini stands out overall, winning in creativity, emotional intelligence and robustness, with a thoughtful mix of practical insight and human nuance.
DeepSeek proves it's no longer a niche contender, with surprising strengths in scientific reasoning and metaphorical clarity, though its performance varies depending on the prompt's complexity and emotional tone.
Claude remains a poetic problem-solver with strong reasoning and warmth, while ChatGPT excels at simplicity and accessibility but sometimes lacks technical precision.
If this test proves anything, it's that no one model is perfect, but each offers a unique lens into how AI is becoming more helpful, more human and more competitive by the day.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

10 Cute June Outfit Ideas for Every Event on Your G-Cal
10 Cute June Outfit Ideas for Every Event on Your G-Cal

Yahoo

timean hour ago

  • Yahoo

10 Cute June Outfit Ideas for Every Event on Your G-Cal

Once summer officially kicks off, your G-Cal will be flooded with invites and events. Music festivals, cookouts, grad parties, Euro trips, fire escape hangs, park dates, bottomless brunches, beach days, winery tours—pause for a deep breath—steamy nights out, and for Gemini babies and my fellow Cancers, birthday parties and celebratory dinners. Yeahhh, you're going to need a lot of cute 'fits this June. Each event will require a separate slay befitting of the occasion, so it's best to start planning looks, eh, now. And if you're currently reading this story, that means you, my friend, are right on schedule. Ahead, you'll find everything from CEO-coded workwear to sporty athleisure looks and Eurocore capri pants. So, pretty much every kind of outfit you might need for the summer days ahead. Cheers! And may summer 2025 be your hottest season yet. I'm a firm believer that everyone should adopt a new Plain White Dress at the beginning of each summer to act as their staple for the next few months. But I'm also a firm believer in the good old-fashioned statement dress. Plaid, particularly, is a great option for a myriad of summertime activities. If you're craving a more eclectic vibe this summer, pop on a jersey with pretty much anything in your closet. This 'fit strikes the perfect balance between sporty-cool and posh-preppy. There's genuinely nothing a good white button-down can't do. Style yours with a pair of baggy dad shorts for a chic, but easy, summertime look. Keep it simple, but be sure your accessories eat down. Say it with me: Chocolate brown isn't only for fall! A luxe espresso-hued midi dress is just as versatile as black, but feels a bit more unexpected. Pair it with a red lip, bag, or statement necklace for a runway-inspired color combo. Fashion girlies have already kicked off capri pant summer and its time the rest of the world caught up. Style knee-length pants with blouses, tanks, or swimsuit tops for a little taste of Europe, no matter where you are. Don't retire your blazers just yet. Summerize your favorite style with a bra top, matching shorts, and cool-girl sneakers. It's the perfect look for those chilly summer nights when you need an extra layer, but also want to serve. Summer stripes are one of the season's most widely-loved trends. They can go sporty, coastal-chic, or academic, depending on how you style them. Try adding khaki pants and bright sneakers to give yours a cool streetwear kind of vibe. One of summer's many dressing woes is a 9-to-5-specific annoyance that has plagued office workers for decades: the dreaded industrial air conditioner. Pop on a dress shirt with your favorite mini skirt for a bit of lightweight coverage. Okay, summer volume! Bring even more drama to your most dramatic bubble dress by layering it over a puff-sleeve blouse. It's OTT in the best possible way. They say opposites attract and that's certainly the case for cargo pants and bra tops. Balance out the tough, utilitarian vibes with a little boudoir moment (i.e., styling them with your hottest lace bra) for the perfect combo of hot and cool. You Might Also Like Here's What NOT to Wear to a Wedding Meet the Laziest, Easiest Acne Routine You'll Ever Try

Reddit suing AI startup Anthropic for breach of contract, using data without authority
Reddit suing AI startup Anthropic for breach of contract, using data without authority

Yahoo

time2 hours ago

  • Yahoo

Reddit suing AI startup Anthropic for breach of contract, using data without authority

SAN FRANCISCO (KRON) — Social media company Reddit has filed a lawsuit against artificial intelligence startup Anthropic for breach of contract. The lawsuit, which was filed in San Francisco on Wednesday, accused the AI company of scraping Reddit user comments to train its chatbot 'Claude.' The suit alleges that Anthropic has been training its AI models using the personal data of Reddit users without their consent. Reddit alleges it has been harmed by the unauthorized use of its content and user data. Bay Area tech layoffs: Google, Microsoft, Cruise all announce job cuts In the lawsuit, Reddit refers to Anthropic as a 'late-blooming artificial intelligence company that bills itself as the white knight of the AI industry.' Reddit-lawsuitDownload 'It is anything but,' the lawsuit states before going on to allege that the AI startup is 'intentionally trained on the personal data of Reddit users without ever requesting their consent.' The lawsuit also alleges that despite Anthropic saying it had blocked its bots from accessing Reddit, the bots have hit Reddit's servers over 100,000 times since July of 2024. Reddit also alleges that unlike its competitors, Anthropic 'has refused to agree to respect Reddit users' basic privacy rights.' The suit further alleges that Anthropic has trained its AI 'on one of the most robust online discussion platforms in the world — Reddit has entered into formal partnership with some of Anthropic's competitors, namely Google and OpenAI. This partnership, the suit explains, allows them to use public Reddit content after agreeing to Reddit's licensing terms. In the lawsuit, Reddit said it is seeking compensation for damages and to prohibit Anthropic from using any Reddit data or content for its commercial offerings or profit. The lawsuit is demanding a jury trial. KRON4 reached out to Anthropic and received the following response: 'We disagree with Reddit's claims and will defend ourselves vigorously.' Reddit and Anthropic both have their headquarters in San Francisco. The Associated Press contributed to this report. Copyright 2025 Nexstar Media, Inc. All rights reserved. This material may not be published, broadcast, rewritten, or redistributed.

Inside KPMG's $100 million AI investment: How Google Cloud's partnership is fueling the firm's new AI services
Inside KPMG's $100 million AI investment: How Google Cloud's partnership is fueling the firm's new AI services

Business Insider

time2 hours ago

  • Business Insider

Inside KPMG's $100 million AI investment: How Google Cloud's partnership is fueling the firm's new AI services

KPMG is a professional services company and one of the Big Four accounting firms in the US. It offers audit, tax, and advisory services to organizations in multiple sectors, including healthcare, finance, banking, and more. KPMG has more than 90 offices and 36,000 employees in the US. It also operates in more than 140 countries. Situation analysis Steve Chase, vice chair of artificial intelligence and digital innovation at KPMG, said part of the company's business involves helping organizations across industries modernize their operations with technology, including their accounting systems and customer service. Recently, Chase said more clients have sought assistance in incorporating artificial intelligence and cloud services into their digital transformation strategies. To help, KPMG announced an expansion of its partnership with Google Cloud in November to advance GenAI, data analytics, and cybersecurity for its clients. The expansion includes a $100 million investment in KPMG's Google Cloud practice. Chase said the goal is to tailor AI services to specific customers, business models, and industries so that these organizations can use AI to improve their businesses, such as by speeding up data analysis. The expanded Google Cloud partnership will initially focus on clients in the retail, healthcare, and financial services industries. Key staff and partners Chase said KPMG has been using AI for several years and has had a long-standing relationship with Google. In 2024, KPMG created the Google Cloud Center of Excellence to combine Google's AI technologies with its own expertise to help clients use AI to boost their businesses. Its latest partnership expansion involves creating new AI tools. KPMG also works with Microsoft, Amazon Web Services, and other tech companies on other AI-related projects. AI in Action KPMG has been using Google Cloud's Vertex AI Search, an AI development platform for building and using GenAI, internally to connect and analyze its vast amount of data. Chase said the company is using this information to develop GenAI agents for clients, such as chatbots to answer questions or tools to gather and analyze data, to address various business challenges and expand capabilities. For example, Chase said KPMG is using Vertex AI and Gemini, a Google Cloud AI-powered assistant, to help financial services companies automate tasks that have been cumbersome for humans, including fraud detection and loan applications. Chase added that KPMG also built an AI "store performance analyzer" for a large retailer. The tool allows the company to use automation to speed up and combine information from store locations, such as inventory levels, sales data, and details about the location, to determine how it performs compared to other stores. "It's able to actually do a detailed analysis in a fast way," which used to be completed by a team of people and take longer, Chase said. "Now, the people involved are actually reviewing the results, as opposed to doing all the manual work of pulling all the data together." For healthcare clients, KPMG is using Google Cloud's Healthcare API to develop AI tools that help doctors improve disease detection, treatment, and overall patient care. Did it work, and how did leaders know? Chase said that KPMG's partnership with Google Cloud could drive $1 billion incremental growth for the firm. "We've been super pleased with how it's going," he said. While he said the company couldn't disclose specifics on how it'll reach this figure, he said it will be a multi-year initiative that involves adding new clients and expanding the AI services it offers to existing companies. KPMG continues to roll out new AI initiatives. In April, the company announced another expansion of its collaboration with Google Cloud on AI tools for the legal and banking industries. KPMG also announced that it's joining the Google Cloud Security Partner Program to enhance cybersecurity for its clients.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store