Latest news with #GPT-5


Time of India
a day ago
- Business
- Time of India
AI will take away these two jobs: What Perplexity AI's CEO says about the future of work
The advancement of artificial intelligence (AI), especially in the past few years, has drastically changed the way we work and its future. The workplace is on the brink of a major transformation, and it's not just tech insiders saying this— it's coming directly from top industry leaders too. One such voice is Aravind Srinivas, CEO of Perplexity AI, who recently gave a striking outlook on how fast AI is reshaping the way we work. In a conversation on The Verge's Decoder podcast, Srinivas shared how their latest product, an AI-powered browser called Comet, is designed to take over common office tasks— specifically targeting roles like recruiters and executive assistants. In the podcast, the message of Perplexity AI's CEO was clear: Certain jobs will soon be taken over by AI. Recruiters could be replaced first Perplexity AI CEO Aravind Srinivas's Career Warning to Young Professionals Srinivas explained that many recruitment tasks are repetitive and so, they are perfectly suited for automation. From finding suitable candidates to sending out emails and tracking responses, these are processes that AI can handle with increasing efficiency. According to him, with more advanced models like GPT-5 or Claude 4.5 on the horizon, these functions could soon be managed entirely by AI systems like Comet. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Swelling and internal bleeding in the brain, help this baby Donate For Health Donate Now Undo Srinivas further shared that what usually takes a human recruiter a whole week could eventually be done in just a single AI prompt. Thus making this job obsolete in the near future. Executive assistants can also lose their jobs Srinivas further said that another job that can soon be replaced by AI is that of executive assistants as AI can be trained to manage important administrative tasks like calendar scheduling, email responses, meeting coordination and making notes, etc. He envisions a near future where AI proactively handles these duties— solving scheduling conflicts, providing meeting briefs, and tracking communications— without needing to be micromanaged. Essentially, AI like Comet could become a self-sufficient digital assistant that handles everything through simple conversations with its user. A wake-up call for young professionals Talking about the future of work and how AI is transforming lives, Srinivas also shared an important message for the workforce—especially young professionals. Speaking to Business Insider, he warned that AI is improving at lightning speed, with major advancements happening every few months. And so, he urged people to shift from mindless scrolling to skill-building, emphasising that those who don't adapt will be left behind. His advice? Spend less time on social media and more time learning how to use AI tools effectively. According to Srinivas, the future belongs to those who can harness AI, not those who ignore it. The next few years could be make-or-break for many in the workforce. AI most 'destructive force' in history, may take away all jobs: Elon Musk tells Rishi Sunak


Time of India
a day ago
- Business
- Time of India
Are you in these two professions? AI will replace you in office then, warns Perplexity CEO
Perplexity AI CEO Aravind Srinivas has revealed that two white-collar roles—recruiters and executive assistants—are likely to be among the first replaced by AI. Speaking on The Verge's podcast, he explained that the company's AI browser, Comet, is being designed to automate key administrative and organizational tasks. With the integration of more advanced reasoning models, Srinivas believes tools like Comet will soon handle everything from candidate outreach to calendar management with minimal human input. Tired of too many ads? Remove Ads Recruiters May Be the First to Go Executive Assistants Also in the Crosshairs Tired of too many ads? Remove Ads The Product and Its Future A Warning for the Workforce As artificial intelligence continues to evolve at a rapid pace, tech leaders are beginning to signal major changes ahead for the workplace. One such voice is Aravind Srinivas , CEO of Perplexity AI , who believes that AI-powered tools will soon take over key functions traditionally handled by white-collar a recent episode of The Verge's Decoder podcast, he outlined how the company's latest product, an AI browser named Comet, is designed to eventually automate the functions of two common office jobs: recruiters and executive explained that recruiting, at its core, involves a series of repetitive tasks that are well-suited to automation. With more advanced reasoning models—like upcoming versions of GPT-5 or Claude 4.5—he believes tools like Comet will be able to handle sourcing candidates, conducting outreach, tracking responses, and managing communication to Srinivas, tasks that typically take a human recruiter a week to complete could be reduced to a single AI-generated prompt. Comet is designed to interact with tools such as LinkedIn, Gmail, and Google Sheets to carry out candidate sourcing, send emails, update progress, and even schedule meetings—all without constant human recruiters, executive assistants are another role likely to be impacted. Comet is already capable of managing administrative responsibilities like calendar scheduling, email responses, and meeting coordination. Srinivas described a scenario where the AI tracks communication statuses, resolves scheduling conflicts, updates spreadsheets, and even prepares briefings before emphasized that the tool is being developed to function proactively, without the need for repeated instructions. The broader vision is for Comet to evolve into an autonomous assistant—a kind of operating system for knowledge workers—capable of executing complex tasks through natural language available by invitation to select premium users, Comet is positioned as a high-end tool for professionals. Srinivas pointed out that while access is limited, the value proposition is significant: if the AI can help generate millions in business output, users will see value in spending substantial sums for high-performing to Business Insider, Srinivas stressed that this level of automation is not only possible but imminent with more powerful AI models. He expects this transformation to unfold rapidly as reasoning capabilities a separate conversation with technology analyst Matthew Berman, Srinivas urged young professionals to shift their focus from passive digital consumption to skill development. He warned that those who fail to adapt to the accelerating pace of AI innovation risk falling highlighted that advancements in AI are now occurring every few months, and the ability to harness these tools effectively will define future success. Rather than spending hours on social media, Srinivas advised the next generation to build competencies that align with the new direction of the job market.


Hans India
a day ago
- Science
- Hans India
OpenAI's Experimental AI Matches Gold Medal Math Olympiad Performance, GPT-5 Launch Soon
In a remarkable step forward for artificial intelligence, an experimental large language model (LLM) from OpenAI has demonstrated gold medal-level performance at the 2025 International Math Olympiad (IMO). This milestone highlights how far AI's reasoning abilities have progressed, with OpenAI's CEO, Sam Altman, confirming that the highly anticipated GPT-5 will arrive soon. OpenAI researcher Alexander Wei announced the breakthrough on X, revealing that the experimental AI tackled five out of six problems from this year's IMO under authentic exam conditions. Scoring 35 out of 42 points, the model reached a level that would earn a human contestant a gold medal at the world's most challenging high school math competition. 'We evaluated our models on the 2025 IMO problems under the same rules as human contestants: two 4.5-hour exam sessions, no tools or internet, reading the official problem statements, and writing natural language proofs,' Wei stated. The IMO is widely recognised for its notoriously demanding problems that test the deepest levels of mathematical creativity and reasoning. Wei noted that the AI's performance marks a significant leap forward compared to previous benchmarks. 'We've now progressed from GSM8K (~0.1 min for top humans) MATH benchmark (~1 min) AIME (~10 mins) IMO (~100 mins),' he added, illustrating the scale of advancement. Independent grading by three former IMO medallists confirmed the model's solutions. The AI successfully solved problems P1 through P5 but did not complete P6. Wei made the model's detailed solutions public, pointing out its 'distinct style,' which reflects its experimental framework. What sets this achievement apart is the AI's ability to generate complex, human-like proofs. 'By going beyond the reinforcement learning paradigm of clear-cut, verifiable rewards we've obtained a model that can craft intricate, watertight arguments at the level of human mathematicians,' Wei explained. Despite this progress, OpenAI does not plan to release this specific IMO-level AI model to the public anytime soon. Wei clarified that while the company is gearing up for GPT-5's rollout, the Math Olympiad project remains separate and will continue behind closed doors. 'We don't plan to release a model with IMO gold level of capability for many months,' he added. Echoing Wei's excitement, OpenAI CEO Sam Altman called the accomplishment 'a significant marker of how far AI has come over the past decade.' He emphasised that the IMO-level AI is not a narrowly trained math tool but part of broader research pushing general-purpose reasoning forward. 'We are releasing GPT-5 soon, but want to set accurate expectations: this is an experimental model that incorporates new research techniques. We don't plan to release a model with IMO gold level of capability for many months,' Altman reiterated. Wei also looked back on his early forecasts, reflecting on how AI has surpassed expectations. 'In 2021, my PhD advisor, Jacob Steinhardt, had me forecast AI math progress by July 2025. I predicted 30 per cent on the MATH benchmark. Instead, we have IMO gold,' he wrote. He credited team members like Sheryl Hsu and Noam Brown for their contributions and congratulated this year's IMO competitors, noting that several OpenAI researchers were once IMO medallists themselves. With GPT-5 on the horizon and experimental AI solving Olympiad-level math, OpenAI's latest strides are set to reshape what's possible for machine reasoning in the years ahead.


India Today
a day ago
- Science
- India Today
Sam Altman says OpenAI LLM achieved IMO gold-level Math skills, GPT-5 launch coming soon
OpenAI model scored 35/42 in 2025 IMO mock test Evaluated under same conditions as human participants GPT-5 coming soon, but won't match IMO model's capabilities An experimental large language model (LLM) developed by OpenAI has achieved gold medal-level performance at the 2025 International Math Olympiad (IMO), setting a new benchmark in mathematical reasoning for AI systems. Announcing the milestone, OpenAI researcher Alexander Wei posted on X that the model solved five out of six problems from the latest IMO under human exam conditions. The model earned 35 out of 42 possible points, a score that would qualify for a gold medal at the real competition. 'We evaluated our models on the 2025 IMO problems under the same rules as human contestants: two 4.5 hour exam sessions, no tools or internet, reading the official problem statements, and writing natural language proofs,' Wei explained. The IMO is regarded as the most prestigious high school maths competition globally, known for its notoriously complex problems. Wei pointed out that such problems demand extended creative reasoning and that achieving gold-level performance represents a leap from earlier benchmarks. 'We've now progressed from GSM8K (~0.1 min for top humans) MATH benchmark (~1 min) AIME (~10 mins) IMO (~100 mins),' he said. Submissions were graded independently by three former IMO medallists, who unanimously validated the model's solutions. According to Wei, 'the model solved P1 through P5; it did not produce a solution for P6.' He shared the model's answers publicly, noting its 'distinct style,' owing to its experimental nature. Wei said what makes the result even more impressive is that IMO proofs are long, complex and hard to verify. "By going beyond the reinforcement learning paradigm of clear-cut, verifiable rewards we've obtained a model that can craft intricate, watertight arguments at the level of human mathematicians.' The LLM that achieved this result will not be released publicly any time soon. Wei clarified that while OpenAI is preparing to launch GPT-5, this IMO-level model is part of a different research track. 'We don't plan to release a model with IMO gold level of capability for many months.' OpenAI CEO Sam Altman echoed this in a follow-up post, calling the achievement 'a significant marker of how far AI has come over the past decade.' He clarified that this model is not a specialised maths system, but a general-purpose reasoning model. 'We are releasing GPT-5 soon but want to set accurate expectations: this is an experimental model that incorporates new research techniques we don't plan to release a model with IMO gold level of capability for many months,' Altman added. Looking back, Wei also reflected on how far AI progress has exceeded expectations. 'In 2021, my PhD advisor Jacob Steinhardt had me forecast AI math progress by July 2025. I predicted 30 per cent on the MATH benchmark Instead, we have IMO gold.' He credited collaborators including Sheryl Hsu and Noam Brown, and concluded by congratulating all 2025 IMO participants, noting that many OpenAI researchers are former IMO medallists themselves. An experimental large language model (LLM) developed by OpenAI has achieved gold medal-level performance at the 2025 International Math Olympiad (IMO), setting a new benchmark in mathematical reasoning for AI systems. Announcing the milestone, OpenAI researcher Alexander Wei posted on X that the model solved five out of six problems from the latest IMO under human exam conditions. The model earned 35 out of 42 possible points, a score that would qualify for a gold medal at the real competition. 'We evaluated our models on the 2025 IMO problems under the same rules as human contestants: two 4.5 hour exam sessions, no tools or internet, reading the official problem statements, and writing natural language proofs,' Wei explained. The IMO is regarded as the most prestigious high school maths competition globally, known for its notoriously complex problems. Wei pointed out that such problems demand extended creative reasoning and that achieving gold-level performance represents a leap from earlier benchmarks. 'We've now progressed from GSM8K (~0.1 min for top humans) MATH benchmark (~1 min) AIME (~10 mins) IMO (~100 mins),' he said. Submissions were graded independently by three former IMO medallists, who unanimously validated the model's solutions. According to Wei, 'the model solved P1 through P5; it did not produce a solution for P6.' He shared the model's answers publicly, noting its 'distinct style,' owing to its experimental nature. Wei said what makes the result even more impressive is that IMO proofs are long, complex and hard to verify. "By going beyond the reinforcement learning paradigm of clear-cut, verifiable rewards we've obtained a model that can craft intricate, watertight arguments at the level of human mathematicians.' The LLM that achieved this result will not be released publicly any time soon. Wei clarified that while OpenAI is preparing to launch GPT-5, this IMO-level model is part of a different research track. 'We don't plan to release a model with IMO gold level of capability for many months.' OpenAI CEO Sam Altman echoed this in a follow-up post, calling the achievement 'a significant marker of how far AI has come over the past decade.' He clarified that this model is not a specialised maths system, but a general-purpose reasoning model. 'We are releasing GPT-5 soon but want to set accurate expectations: this is an experimental model that incorporates new research techniques we don't plan to release a model with IMO gold level of capability for many months,' Altman added. Looking back, Wei also reflected on how far AI progress has exceeded expectations. 'In 2021, my PhD advisor Jacob Steinhardt had me forecast AI math progress by July 2025. I predicted 30 per cent on the MATH benchmark Instead, we have IMO gold.' He credited collaborators including Sheryl Hsu and Noam Brown, and concluded by congratulating all 2025 IMO participants, noting that many OpenAI researchers are former IMO medallists themselves. Join our WhatsApp Channel


Mint
2 days ago
- Business
- Mint
Mint Explainer: Is OpenAI exaggerating the powers of its new ChatGPT Agent?
Leslie D'Monte OpenAI has flagged the agent as high-risk under its safety framework. Is this just marketing hype or a sign that AI is genuinely becoming more powerful and autonomous? OpenAI CEO Sam Altman. Photo AFP Gift this article On Thursday, OpenAI launched its autonomous ChatGPT Agent, a tool that's capable of finding and buying things online, managing your calendar, and booking you an appointment with a doctor. It's essentially a digital assistant that doesn't just provide information but complete actual tasks. On Thursday, OpenAI launched its autonomous ChatGPT Agent, a tool that's capable of finding and buying things online, managing your calendar, and booking you an appointment with a doctor. It's essentially a digital assistant that doesn't just provide information but complete actual tasks. That being said, OpenAI has flagged the agent as high-risk under its safety framework, warning it could potentially be used to create dangerous biological or chemical substances. Is this just marketing hype, timed to build momentum for the launch of GPT-5, or a sign that AI agents are genuinely becoming more powerful and autonomous, akin to the agents who protect the computer-generated world of The Matrix? What is ChatGPT Agent? Say you want to rearrange your calendar, find a doctor and schedule an appointment, or research competitors and deliver a report. ChatGPT Agent can now do it for you. Also Read | Deep research with AI is days' worth of work in minutes The agent can browse websites, run code, analyse data, and even create slide decks or spreadsheets—all based on your instructions. It combines the strengths of OpenAI's earlier tools—operator (which could navigate the web) and deep research (which could analyse and summarise information)—into a single system. You stay in control throughout: ChatGPT asks for permission before doing anything important, and you can stop or take over at any time. This new capability is available to Pro, Plus, and Team users through the tools dropdown. How does it work? ChatGPT Auses a powerful set of tools to complete tasks, including a visual browser to interact with websites like a human, a text-based browser for reasoning-heavy searches, a terminal for code execution, and direct application programming interface (API) access. It can also connect to apps such as Gmail or GitHub to fetch relevant information. You can log in to websites within the agent's browser, allowing it to dig deeper into personalised content. All of this runs on its own virtual computer, which keeps track of context even across multiple tools. The agent can switch between browsers, download and edit files, and adapt its methods to complete tasks quickly and accurately. It's built for back-and-forth collaboration—you can step in anytime to guide or change the task, and ChatGPT can ask for more input when needed. If a task takes time, you'll get updates and a notification on your phone once it's done. Has OpenAI tested its performance? OpenAI said on Humanity's Last Exam (HLE), which tests expert-level reasoning across subjects, ChatGPT Agent achieved a new high score of 41.6, rising to 44.4 when multiple attempts were run in parallel and the most confident response was selected. On FrontierMath, the toughest known math benchmark, the agent scored 27.4% using tools such as a code-executing terminal—far ahead of previous models. In real-world tasks, ChatGPT agent performs at or above human levels in about half of the cases, based on OpenAI's internal evaluations. These tasks include building financial models, analysing competitors, and identifying suitable sites for green hydrogen projects. ChatGPT Agent also outperforms others on specialised tests such as DSBench for data science, and the SpreadsheetBench for spreadsheet editing (45.5% vs Copilot Excel's 20.0%). On BrowseComp and WebArena, which test browsing skills, the agent achieves the highest scores to date, according to OpenAI. What are some of the things it can do? Consider the case of travel planning. The agent won't just suggest ideas but navigate booking websites, fill out forms, and even make reservations one you give it permission. You can also ask it to read your emails, find meeting invitations, and automatically schedule appointments in your calendar, or even draft and send follow-up emails. This level of coordination typically required juggling between apps, but the agent manages it in a single conversational flow. Another example involves shopping and price comparison. You can tell the agent to 'order the best-reviewed smartphone under ₹ 15,000", and it can search online stores, compare prices and reviews, and proceed to checkout on a preferred platform. Customer support and task automation are other examples, where the agent is used to troubleshoot an issue, log into support portals, and even file return or refund requests. How are AI agents typically built? Unlike basic chat bots, AI agents are autonomous systems that can plan, reason, and complete complex, multi-step tasks with minimal input—such as coding, data analysis, or generating reports. They are built by combining ways to take in information, think, and take action. Developers begin by deciding what the agent should do, following which the agent collects data like such as or images from its environment. AI agents use large language models (LLMs) like GPT-4 as their core 'brain", which allows them to understand and respond to natural language instructions. To allow AI agents to take action, developers connect the LLM to things like a web browser, code editor, calculator, and APIs for services such as Gmail or Slack. Frameworks like LangChain help integrate these parts, and keep track of information. Some AI agents learn from experience and get better over time. Testing and careful setup make sure they work well and follow rules. Does ChatGPT Agent have credible competition? Google's Project Astra, part of its Gemini AI line, is developing a multimodal assistant that can see, hear, and respond in real time. Gemini CLI is an open-source AI agent that brings Google's Gemini model directly to the terminal for fast, lightweight access. It integrates with Gemini Code Assist, offering developers on all plans AI-powered coding in both VS Code and the command line. Microsoft is embedding Copilot into Windows, Office, and Teams, giving its agent access to workflows, system controls, and productivity tools, soon enhanced by a dedicated Copilot Runtime. Meta is building more socially focused agents within messaging and the metaverse, which could evolve into utility tools. Apple is revamping Siri through Apple Intelligence, combining GPT-level reasoning with strict privacy features and deep on-device integration. Other smart agents include Oracle's Miracle Agent, IBM's Watson tools, Agentforce from Salesforce Anthropic's Claude 3.5, and Perplexity AI's action-oriented agents through its Comet project, blending search with agentic behaviour. The competitive advantage, though, may go to companies that can integrate these AI agents into everyday applications and call for action with a single, unified tool – a task that ChatGPT Agent has demonstrated. Why did OpenAI warn that ChatGPT Agent could be used to trigger biological warfare? OpenAI claimed ChatGPT Agent's superior capabilities could, in theory, be misused to help someone create dangerous biological or chemical substances. However, it clarified that there was no solid evidence it could actually do so. Regardless, OpenAI is activating the highest level of safety measures under its internal 'preparedness framework'. These include thorough threat modeling to anticipate potential misuse, special training to ensure the model refuses harmful requests, and constant monitoring using automated systems that watch for risky behaviour. There are also clear procedures in place for suspicious activity. Should we take this risk seriously? Ja-Nae Duane, AI expert and MIT Research Fellow and co-author of SuperShifts, said the more autonomous the agent, the more permissions and access rights it would require. For example, buying a dress requires wallet access; scheduling an event requires calendar and contact list access. 'While standard ChatGPT already presents privacy risks, the risks from ChatGPT Agent are exponentially higher because people will be granting it access rights to external tools containing personal information (like calendar, email, wallet, and more). There's a significant gap between the pace of AI development and AI literacy; many people haven't even fully understood ChatGPT's existing privacy risks, and now they're being introduced to a feature with exponentially more risks," he said. Also Read | Google's Veo 3 brings the era of video on command Duane added that the key risks included data leaks, mistaken actions, prompt injection, and account compromise, especially when handling sensitive information. Malicious actors, he warned, could exploit them by manipulating inputs, abusing tool access, stealing credentials, or poisoning data to bias outputs. Poor third-party integration and an over-reliance of them could worsen the impact, while the agent's 'black box" nature would make it hard to trace errors, he added. In the wrong hands, these agents could be weaponised for fraud, phishing, or even to generate malware. What are the other concern areas for enterprises? Developers are increasingly deploying AI agents across IT, customer service, and enterprise workflows. According to Nasscom, 46% of Indian firms are experimenting with these agents, particularly in IT, HR, and finance, while manufacturing leads in robotics, quality control, and automation. Beyond concerns around hallucinations, security, privacy, and copyright or intellectual property (IP) violations, a key challenge for businesses is ensuring a return on investment. Gartner noted that many so-called agentic use cases could be handled by simpler tools and predicted that more than 40% of such projects would be scrapped by 2027 over high costs, unclear value, or inadequate risk controls. Of the thousands of vendors in this space, only around 130 are seen as credible; many engage in 'agent washing" by repackaging chatbots, robotic process automation (RPA), or basic assistants as autonomous agents. Nasscom corroborated these concerns, highlighting that 62% of enterprises were still only testing agents in-house. Why is 'humans-in-the-loop' a must? OpenAI CEO Sam Altman advised granting agents only the minimum access needed for each task, not blanket permissions. Nasscom believes that to scale responsibly, enterprises must prioritise human-AI collaboration, trust, and data readiness. It has recommended firms adopt AI agents with a 'human-in-the-loop" approach, reflecting the need for oversight and contextual judgment. According to Duane, users must understand both the tool's strengths and its limits, especially when handling sensitive data. Caution is key, as misuse could have serious consequences. She also emphasised the importance of AI literacy, noting that AI was evolving far faster than most people's understanding of how to use it responsibly. Also Read | Mint Primer: Are firms wasting their money on AI agents? Topics You May Be Interested In