Latest news with #webBrowsing

Microsoft's Edge just got a major AI makeover — meet Copilot Mode

Yahoo

2 days ago

Business
Yahoo

Microsoft's Edge just got a major AI makeover — meet Copilot Mode

When you buy through links on our articles, Future and its syndication partners may earn a commission. Microsoft is reimagining web browsing with Copilot Mode. Similar to OpenAI and Perplexity, this experimental new AI-powered mode in Edge understands your tabs, takes voice commands and even plans future tasks. Now available on Windows and Mac, Copilot Mode is completely free if you opt in. Smarter browsing With Copilot Mode turned on, Edge replaces your new‑tab page with a simplified layout centered around a single input box, combining search, chat and navigation. Once enabled, Copilot can access all your open tabs (with your permission) and use that context to answer questions or compare information without flipping between pages. For instance, if you're researching restuarant options across several tabs, you can now ask Copilot to identify the soonest availability, most affordable choice or closest location, and it takes care of everything for you. Voice and task-driven assistants Starting today, voice control is live. Users can now talk to Copilot and ask their queries that way instead of typing. Soon, Microsoft plans to allow Copilot to access browsing history and credentials (with your consent) to do things like book tickets or manage errands, truly acting on your behalf, which is similar to what ChatGPT Agent is currently doing. Copilot can now compare price options (like Google AI), offer suggestions and make reservations. However, approving payment details manually is still something users need to do (thankfully). Stay focused without losing your place Copilot slips into the sidebar or new tab, allowing you to check summaries, translate content, convert units or ask questions without losing access to the original page. For tab hoarders like me, I expect this to help with productivity as it keeps distractions down and work flow up. Designed with privacy and optionality in mind For those feeling skeptical about a broswer takeover, rest assured that Copilot Mode is fully optional. Users enable it manually and can disable it anytime. When active, Microsoft makes it clear whenever Copilot is listening, viewing your tabs or accessing data. All data is handled under Microsoft's privacy standards and only used with your explicit permission. And while usage limits apply, the feature is free for now. Microsoft hasn't yet confirmed if it will join a subscription tier later. Copilot-guided browsing journeys on the horizon Microsoft says forthcoming updates will let Copilot identify ongoing browsing themes and surface helpful suggestions and next steps. Whether you're planning a trip or researching a project, Copilot Mode promises to track the thread of your tasks. The goal of this new feature is to proactively help users stay productive while always offering clear visual cues and only if you opt in. Bottom line With AI-integrated browsers like Google AI, Comet, and others already in motion, Microsoft's upgrade places Edge back in the spotlight with other AI giants. If you're curious about how AI can change web browsing for planning, research or multitasking, this AI browser is worth a try. Copilot Mode trials are simple to enable, reversible and safe. And for the time it's free, it's worth seeing if AI-assisted browsing accelerates your workflow. More from Tom's Guide ChatGPT-5 launch expected soon — here's everything we know so far I tested ChatGPT Agent for a week — the good, the bad and the 'wait, it did what?' Here's why you shouldn't use ChatGPT as your therapist — according to Sam Altman

OpenAI's ChatGPT Agent Is Haunting My Browser

WIRED

22-07-2025

Business
WIRED

OpenAI's ChatGPT Agent Is Haunting My Browser

Jul 22, 2025 6:30 AM New tools from OpenAI and Perplexity can browse the web for you. If the idea takes off, these generative AI agents could turn the internet into a ghost town where only bots roam. Photo-Illustration:Most people's browser tabs are filled with unread news articles. Mine are filled with AI agents and ghost clicks. I have four instances of OpenAI's ChatGPT Agent—the generative AI tool released last week, which can run searches and perform tasks on the web—already open with each running in its own tab. I've given these first four agents relatively simple jobs based on ChatGPT's suggestions. One is clicking around to find a birthday gift on the Target website, and another is generating a pitch deck about robotic dogs. I open a fifth tab in order to try something more experimental: I want to see how good this ChatGPT Agent is at chess. After typing in some instructions, I watch as a ghostly cursor floats across my screen and the ChatGPT Agent goes to and plays an online opponent, all in a virtual browser. Things go south pretty quickly. The game's strategy isn't what trips up the AI tool, it's the act of moving the chess pieces that actually proves to be the most difficult. 'I'm focusing on accurate positioning as I continue playing despite earlier misclicks,' the agent says in its internal log before eventually quitting and letting me know that the controls were too difficult to navigate. Over the past few years, browser developers have integrated AI tools with middling success. Though, in recent weeks, the idea of a web browser enhanced by a baked-in generative AI chatbot has resurged with the release of OpenAI's ChatGPT Agent and Perplexity's Comet. The two releases are quite different in their execution. Comet is a stand-alone browser, so you can use it to surf the web and then summon the AI assistant to help write an email or complete a menial chore. OpenAI built its browsing tool inside of a chatbot; you talk to the chatbot through a web interface to give it tasks, and then the bot runs its own virtual browser inside your browser to complete them. Both releases can take control of cursors, enter text, and click on links. If this trend takes off, these kinds of AI-powered browsers could transform the internet into a ghost town where agents run amok and humans rarely venture. Tangled Web Despite the continued AI hype, my initial impression of OpenAI's ChatGPT Agent is that the glitchy feature currently seems like a proof of concept instead of a fully baked release. When executing the various tasks I gave it, the ChatGPT Agent often clicked wrong or fumbled through other errors. Additionally, its guardrails appeared inconsistent; while some explicit prompt requests, like asking it to fetch pornographic videos or 'find a dildo,' were denied by the agent, ChatGPT spent 18 minutes shopping for the perfect 'c-ring' on an X-rated website for adult toys: 'I've gathered details on 10 metal cock rings, including various prices and features.' I also couldn't help but wonder how this approach to browsing the internet might further hollow out the market for digital display ads, a business that's already struggling. My agents passed over ads for everything from rental cars to real estate investments. If you're not actively watching the agent click around in real time, you can watch replays afterward and see everything that appeared in the browser while the AI tool was in control, ads included. It makes sense that users would speed-scrub through a replay now, while the nascent feature is filled with errors. But if the accuracy rate for AI agents improves over time, then fewer people will feel the need to watch over their agent's shoulder, and fewer humans will be seeing those ads. At that point, it's hard to imagine advertisers sticking around. The more I watched replays of its actions, the more the agent gave me an unsettling, eerie feeling—not of being understood, but of being mimicked. It was like an obsessive robot stalker had watched humans through a window, meticulously taking notes about how they used the web in an effort to replicate their actions. It was able to do a hollow imitation of human behavior, but not able to grasp fully why individual decisions were being made. The skin of my arms filled with the kind of goosebumps you get hearing a human-like laugh while walking home alone late at night, looking around, and only seeing a lone crow perched high up on the telephone wire. Further leaning into the psuedo-humanness, the ChatGPT Agent is programmed to generate descriptions, from a first-person perspective, of each step on its journeys around the internet. While clicking, the simulation 'thinks' and sometimes gets 'confused.' As a whole, the ghostly agent is stuffed into an ill-fitting human suit. Running five OpenAI agents simultaneously in my browser quickly became overwhelming, and I couldn't actively track what each of them was up to. Yet, boosters of generative AI adoption and 'multi-agent orchestration' see this kind of approach as child's play. 'I'm excited by simulation tech where 20,000 AIs are all working alongside each other,' says Allie K. Miller, an AI-focused business consultant. Miller's approach to AI agents is more aligned with Silent Hill —and its fully haunted ghost town—than a small-scale haunting like The Conjuring . This grandiose vision of the agentic future upheld by AI proponents—thousands of phantom bots swarming the web at once, all at the direction of a single person—still feels a long way off. My artisanal quintet of agents struggled with the handful of tasks I gave them, even when the prompts were just the ones suggested by ChatGPT. The agent I sent off in search of a birthday gift clicked on the wrong thing multiple times, similar to the chess-playing agent that couldn't click on the right game piece. The agent generating a pitch deck took 26 minutes to gin up a presentation, and the results looked rushed, like something a struggling middle schooler would create the night before an assignment was due. Taking forever to generate mid results? Now that's what I call a spooky story.

Latest news with #webBrowsing

Microsoft's Edge just got a major AI makeover — meet Copilot Mode

OpenAI's ChatGPT Agent Is Haunting My Browser

Get Started Now: Download the App