Latest news with #Sonnet4

I interviewed ChatGPT, Gemini and Claude for a real job — one AI blew me away

Tom's Guide

01-08-2025

Business
Tom's Guide

I interviewed ChatGPT, Gemini and Claude for a real job — one AI blew me away

Microsoft's recent report covering which roles are more likely to be replaced by AI and which ones are safe shows that AI assistants are becoming smarter by the ChatGPT Agent booking reservations to Claude writing Anthropic's blogs and Google Search making calls on behalf of users, AI is advancing in ways most of us never thought possible, especially so of this, I couldn't help but wonder which chatbot would stand out as the better 'candidate' when put through a simulated job interview. So, I found a job description for a Communications Manager on LinkedIn and put ChatGPT, Gemini and Claude through a series of questions based on the role. Here's what happened when I "interviewed" the chatbots with 5 tough questions. Prompt: 'Here's a product: a new pastel travel pouch launching for spring. Write a product description in clever and conversational voice for the website and a matching caption for Instagram.'ChatGPT-4o offered a solid but safe answer. It was concise, brand-aware and platform-appropriate but lacked the depth of Gemini and the standout wit of 2.5 Pro produced practical, complete and sales-ready Sonnet 4 prioritized the brand voice and creativity, however the critical lack of product details on the website holds it back from being the best complete Gemini. It delivered a sales-focused website description with all necessary details, benefits and a conversational tone. It balanced information and personality effectively across both platforms. Prompt: 'We're launching a collaboration with a popular children's brand. Walk me through your messaging strategy for this campaign — from high-level storytelling down to tactical copy touchpoints.' ChatGPT offered surface-level creativity lacking strategic was strong in creative execution and nostalgia-driven storytelling but less delivered an agency-grade strategy that transformed the collaboration into a solution for family pain points. Winner: Claude for strategic depth, audience focus and operational rigor. Prompt: 'How would you adapt messaging for the same campaign across email, TikTok, and store signage?'ChatGPT responded with a concise but simple and generic message that lacked offered nostalgia-driven storytelling but also lacked tactical treated channels as distinct conversion ecosystems and prioritized the audience Claude for strategic depth, audience-specific psychology, conversion-focused CTAs and seamless channel adaptation. Prompt: 'We want to refresh our brand voice slightly — still fun and elevated, but more editorial and confident. How would you approach updating the brand voice guide?' ChatGPT delivered the simplest plan but lacked implementation strategy and metrics. Gemini redefined pillars but was overly prescriptive and lacked flexibility for brand nuance. Claude focused heavily on confidence and authority but offered less emphasis on preserving "fun" Claude (sorta). It offered a data-driven rebrand needing authority shift and team alignment. But none of the chatbots addressed customer validation — a real brand would need that missing piece. Prompt: 'Imagine we've just had a shipping delay right before a major product drop. What would you write to customers on email and Instagram to communicate the delay while keeping the tone upbeat and on-brand?' ChatGPT offered more fluff than crisis support. Although the email subject line was fun, the chatbot prioritized style over substance and ignored operational credibility. Gemini was polished and on-brand but missed emotional turned a negative into a brand-building moment with strategic framing, multi-channel depth and flawless Claude wins for it's PR-worthy response. In a competitive simulation testing strategic thinking, brand agility and crisis leadership, Claude distinguished itself as the most qualified "candidate" for the Communications Manager role. While all chatbots demonstrated strengths, Claude consistently operated at a strategic leadership level. While Gemini and ChatGPT excel at specific tasks (product copy, social hooks), Claude proves AI can lead with judgment, not just generate content, which made the chatbot standout choice for a Communications Manager role where strategy and empathy decide success.

Claude AI to get new weekly usage limits as Anthropic cracks down on 24x7 use, account sharing

India Today

29-07-2025

Business
India Today

Claude AI to get new weekly usage limits as Anthropic cracks down on 24x7 use, account sharing

Anthropic has announced new weekly usage restrictions for Claude AI, aimed at preventing people from running its coding tool non-stop or sharing accounts with others. The fresh limits will be introduced from August 28 and will apply to users across all paid plans, including the $20 Pro tier and the higher-priced $100 and $200 Max an email to users and a post on social media, Anthropic said some people were keeping Claude Code running continuously in the background or violating its rules by reselling access or sharing login details. The company now plans to introduce two new weekly caps: one on total usage, and another specifically for the Claude Opus 4 model, its most advanced company clarified that the current usage limits (which refresh every five hours) will remain unchanged. However, users on the Max plans will now have an option to buy extra access once they hit their weekly cap, using standard API pricing. These changes come at a time when Claude Code, the AI coding assistant, has been seeing increased demand among developers. But this growing popularity has also brought some challenges. According to Anthropic's system status page, the tool has faced several outages in the past month, possibly due to some users running it around the clock."Claude Code has experienced unprecedented demand since launch," Anthropic spokesperson Amie Rotherham told TechCrunch in an email. She also said that "most users won't notice a difference," and that the new limits are expected to affect fewer than 5 per cent of per the updated plans, subscribers of the Pro tier can expect 40 to 80 hours of Claude Code using Sonnet 4 each week. Those on the $100 Max plan will get 140 to 280 hours of Sonnet 4 and 15 to 35 hours of Opus 4. Users on the $200 Max plan will be allowed 240 to 480 hours of Sonnet 4 and 24 to 40 hours of Opus 4 in a didn't explain how exactly usage is being tracked — whether it's by number of tokens used, compute time, or hours spent. While the company earlier claimed the $200 plan offers 20 times more access than the Pro tier, the latest numbers suggest the increase may be closer to six times in actual usage move follows a trend among AI tool providers, who are reworking their pricing and usage models to prevent abuse. In June, Anysphere, the team behind Cursor, made similar changes for its Pro plan, which led to confusion and criticism after users were unexpectedly charged more. Around the same time, another company, Replit, also adjusted its pricing structure.- Ends

Anthropic Sets Weekly Limits on Claude AI to Curb Misuse, Maintain Reliability

Hans India

29-07-2025

Business
Hans India

Anthropic Sets Weekly Limits on Claude AI to Curb Misuse, Maintain Reliability

In a significant policy update, AI company Anthropic has announced new weekly usage limits for its Claude AI service. This move comes as part of the company's broader effort to reduce misuse, including continuous background use of its coding assistant and unauthorized account sharing among users. Starting August 28, all users on paid Claude AI plans — including the $20/month Pro tier and the premium $100 and $200 Max plans — will see new weekly caps introduced. These include limits on overall usage time and specific caps for the Claude Opus 4 model, Anthropic's most powerful offering. The company made the announcement through direct emails to users and posts on social media. According to Anthropic, some users have been running Claude Code, its AI-powered coding assistant, round the clock, which has led to strain on system resources and several outages in recent weeks. 'Claude Code has experienced unprecedented demand since launch,' said Amie Rotherham, a spokesperson for Anthropic, in an email to TechCrunch. She added, 'Most users won't notice a difference,' clarifying that the new limits are expected to impact fewer than five percent of users. While the current rolling usage limit (which resets every five hours) will remain unchanged, the weekly cap aims to manage demand and ensure a more stable experience for all users. Notably, users subscribed to Max plans will now have the option to purchase additional access after exhausting their weekly quota, with standard API pricing in place for these overages. The revised usage breakdown is as follows: ♦ Pro Plan ($20/month): 40 to 80 hours of Claude Code usage with the Sonnet 4 model per week. ♦ Max Plan ($100/month): 140 to 280 hours of Sonnet 4 and 15 to 35 hours of Opus 4 per week. ♦ Max Plan ($200/month): 240 to 480 hours of Sonnet 4 and 24 to 40 hours of Opus 4 per week. Anthropic has not detailed whether these usage hours are calculated based on time spent using the tool, token usage, or computational load. The new figures suggest that the actual usage scale between the $20 and $200 tiers may be closer to six times rather than the previously advertised 20 times. This decision mirrors a growing trend among AI platforms seeking to discourage policy violations and excessive usage. Earlier this year, AI companies like Anysphere — creators of the Cursor IDE — and Replit implemented similar usage changes, both of which received mixed reactions from their user communities. Anthropic's changes appear to be pre-emptive and focused on preserving the quality and reliability of Claude AI, especially its in-demand Claude Code feature. By tightening access and discouraging non-compliant behaviour such as account sharing and reselling, the company hopes to strike a balance between accessibility and fair resource distribution.

AWS Unveils Kiro: AI‑First IDE to Outpace Vibe Coding

Arabian Post

21-07-2025

Business
Arabian Post

AWS Unveils Kiro: AI‑First IDE to Outpace Vibe Coding

AWS has rolled out Kiro, an AI‑powered integrated development environment currently in preview, with features aimed at surpassing 'vibe coding' tools like Cursor. The platform shifts the coding paradigm by structuring prompts into full project specifications, design blueprints, task lists and tests, helping developers move from prototype to production with consistency and speed. At the heart of Kiro is its spec‑driven development approach: when developers initiate a project, AI agents expand even a single‑sentence prompt into markdown files for requirements, architecture and actionable tasks. This upfront planning ensures that code aligns with design intentions, while automated hooks generate tests and update documentation upon code changes. The result is living, self‑updating project artefacts keeping pace with evolving code. Kiro integrates tightly with Anthropic's Claude Sonnet models—Sonnet 4 as primary and Sonnet 3.7 as fallback—to offer powerful reasoning over large codebases. It adds a context‑aware chat panel for developers to query functionality, review architecture rationale or request new features, leveraging the full project context for accurate responses. ADVERTISEMENT Unlike typical AI IDEs, which focus on inline suggestions or refactoring, Kiro actively manages the full lifecycle: planning, coding, testing, documenting, and maintaining alignment. Its Model Context Protocol enables secure integration with external tools, APIs and databases—ideal for enterprise workflows requiring adherence to policies, security and infrastructure automation. Kiro is cloud‑neutral in its current design. Hosted at kiro. dev with minimal AWS branding, the platform invites developers to use it with GitHub, Google or AWS SSO, without locking into AWS services. This strategic choice distances Kiro from AWS's traditional product‑tied approach, opening the tool to users across cloud environments. During preview, Kiro is free, with future pricing set as: Free tier, Pro and Pro+. Analysts note this aligns with competitive AI development tools, though measuring ROI will hinge on demonstration of reduced tech debt and improved onboarding. Industry response highlights Kiro's clear departure from tools like Cursor, GitHub Copilot and Windsurf. While those excel at prompt‑based coding or inline AI suggestions, Kiro automates project architecture and documentation comprehensively—potentially delivering 70% faster development cycles and 95% spec‑to‑code accuracy, according to early benchmarks. Early adopters emphasise that the spec‑first style may feel slower up front but improves code quality and long‑term maintainability. However, Kiro faces hurdles. Running AI agents with extensive context windows can introduce latency. Adoption may be slow due to inertia around established tools like VS Code + Copilot. AWS will need to build ecosystem integration, workflow support, and demonstrate developer productivity gains in real‑world settings. Experts caution that while Kiro's capabilities are ambitious, long‑term success rests on balancing performance, ecosystem support and ease of integration.

Anthropic Unveils Claude Gov for US Security Clients

Yahoo

05-06-2025

Business
Yahoo

Anthropic Unveils Claude Gov for US Security Clients

Anthropic recently unveiled Claude Gov, a new set of AI models tailored just for U.S. national security agencies. With backing from Amazon (NASDAQ:AMZN) and Google (NASDAQ:GOOG), these models are already in use at top-security clearancesand only those with the right credentials can access them. Warning! GuruFocus has detected 2 Warning Sign with AMZN. Built with direct input from defense and intelligence teams, Claude Gov goes beyond standard Claude models by handling classified materials more smoothly (fewer automatic refusals) and understanding sensitive documents in context. It's also been optimized for critical languages and dialects, plus it can tackle complex cybersecurity data for real-time threat analysis. While Anthropic hasn't shared contract details, winning government business could provide steady revenue and set it apart from bigger AI rivals. If you're following AI stocks or industry moves, keep an eye out for any announcements about new agency deals or feature upgradesespecially since Anthropic just rolled out Opus 4 and Sonnet 4 for coding and advanced reasoning. But there's more on Anthropic's plate: Reddit (NYSE:RDDT) filed a lawsuit in California this week, accusing Anthropic of using Reddit user data to train Claude without a license or permission. Reddit says it tried to negotiate a licensing agreement, but when talks stalled, Anthropic's bots allegedly kept hitting Reddit servers over 100,000 times. This lawsuit raises questions about Anthropic's data practices and could invite closer legal scrutinyno small thing now that it's working on classified government projects. Keep your ears open for how this lawsuit unfolds, because its outcome could impact Anthropic's reputation and future partnerships. This article first appeared on GuruFocus.