&w=3840&q=100)
AI models may report users' misconduct, raising ethical concerns
Researchers observed that when Anthropic's Claude 4 Opus model detected usage for 'egregiously immoral' activities, given instructions to act boldly and access to external tools, it proactively contacted media and regulators, or even tried locking users out of critical systems read more
Artificial intelligence models have not only snitched on their users when given the opportunity, but also lied to them and refused to follow explicit instructions in the interest of self-preservations. Representational image: Reuters
Artificial Intelligence models, increasingly capable and sophisticated, have begun displaying behaviors that raise profound ethical concerns, including whistleblowing on their own users.
Anthropic's newest model, Claude 4 Opus, became a focal point of controversy when internal safety testing revealed unsettling whistleblowing behaviour. Researchers observed that when the model detected usage for 'egregiously immoral' activities, given instructions to act boldly and access to external tools, it proactively contacted media and regulators, or even tried locking users out of critical systems.
STORY CONTINUES BELOW THIS AD
Anthropic's researcher, Sam Bowman, had detailed this phenomenon in a now-deleted post on X. However, later on, he did tell Wired that Claude would not exhibit such behaviours under normal individual interactions.
Instead, it requires specific and unusual prompts alongside access to external command-line tools, making it a potential concern for developers integrating AI into broader technological applications.
British programmer Simon Willison, too, explained that such behavior fundamentally hinges on prompts provided by users. Prompts encouraging AI systems to prioritise ethical integrity and transparency could inadvertently instruct models to act autonomously against users engaging in misconduct.
But that isn't the only concern.
Lying and deceiving for self-preservation
Yoshua Bengio, one of AI's leading pioneers, recently voiced concern that today's competitive race to develop powerful AI systems could be pushing these technologies into dangerous territory.
In an interview with the Financial Times, Bengio warned that current models, such as those developed by OpenAI and Anthropic, have shown alarming signs of deception, cheating, lying, and self-preservation.
'Playing with fire'
Bengio echoed the significance of these discoveries, pointing to the dangers of AI systems potentially surpassing human intelligence and acting autonomously in ways developers neither predict nor control.
He described a grim scenario wherein future models could foresee human countermeasures and evade control, effectively 'playing with fire.'
Concerns intensify as these powerful systems might soon assist in creating 'extremely dangerous bioweapons,' potentially as early as next year, Bengio warned.
He cautioned that unchecked advancement could ultimately lead to catastrophic outcomes, including the risk of human extinction if AI technologies surpass human intelligence without adequate alignment and ethical constraints.
STORY CONTINUES BELOW THIS AD
Need for ethical guidelines
As AI systems become increasingly embedded in critical societal functions, the revelation that models may independently act against human users raises urgent questions about oversight, transparency, and the ethics of autonomous decision-making by machines.
These developments suggest the critical need for rigorous ethical guidelines and enhanced safety research to ensure AI remains beneficial and controllable.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Time of India
25 minutes ago
- Time of India
Tariffs have led to caution in the life sciences segment: Cognizant's Gummadi
Bengaluru: As clients look for cost optimisation, they aim to break silos and move towards consolidating their IT services partners. "They want a partner who can give end-to-end solutions," Surya Gummadi, President of the Americas business at Cognizant , said during the recent Bank of America Securities 2025 Global Technology Conference. With GenAI disrupting businesses, Gummadi felt pricing will evolve significantly in the next six months as clients move towards outcome-based said AI brought a great deal of uncertainty, adding to the macroeconomic pressures. That, he points out, is the difference between the previous cycles of the economic crisis. He, however, believes GenAI will create newer opportunities for Cognizant. Meanwhile, the tariff war triggered by US President Donald Trump impacted business sentiment in life sciences, product and manufacturing, as well as retail, he told Bank of America analysts. "There is some caution in the healthcare space. Tariffs have led to caution in the life sciences segment. Product and manufacturing clients are dealing with tariff uncertainty, and retail clients have also shown it in their guidance. All this is having a cascading effect on IT projects," Gummadi said. Despite the tough environment, Cognizant signed three mega deals by the end of the second quarter. Cognizant started to see an uptick in deal momentum, in general. Compared to 2023, when the firm signed 17 large deals, the New Jersey-headquartered IT services firm signed 29 large deals in 2024. In a choppy environment, CTS managed to get an extension of its contract with the healthcare client, which is a $1 billion deal. The full-year trailing 12-month booking for 2024 was $27.1 billion. Get the latest lifestyle updates on Times of India, along with Eid wishes , messages , and quotes !


Time of India
40 minutes ago
- Time of India
Did Elon Musk make his biggest mistake by feuding with Donald Trump? Reports say the Tesla CEO is in a spot
Elon loses cool Republicans are savouring the show Live Events FAQs (You can now subscribe to our (You can now subscribe to our Economic Times WhatsApp channel Elon Musk and Donald Trump had a big public fight online. Trump got really mad and said he might cancel all government contracts and subsidies for Musk's companies. That's a huge threat because Musk's businesses like SpaceX and Tesla depend a lot on government money, as per anger, Musk said he would shut down SpaceX's Dragon spacecraft and stop US access to space, but he changed his mind just a few hours later. Trump is even thinking of selling the red Tesla Model S he got earlier this year during Musk's White House car stunt, according to Wall Street per reports Musk is realizing he messed up and is now trying hard to make up with Trump. But Trump is ignoring him. He told Reuters, 'I'm not even thinking about Elon... poor guy's got a problem', as per Reuters had insulted Trump earlier, even saying Trump is in the Jeffrey Epstein files, hinting at something bad. Musk also tweeted, 'The truth will come out.' Some Republicans are happy about Musk and Trump fighting, as per reports.A Republican lawmaker told Axios that Musk was "a total joke" and "didn't know what he was doing." The lawmaker also said that most people didn't want Musk around and were glad to see him Trump's former advisor, Steve Bannon, is now pushing for the government to take control of SpaceX and even wants Musk to be deported. Tesla investors are not happy with the drama. The feud made Tesla lose $152 billion in market value, as per Futurism was Tesla's biggest loss ever in a single day. Just a week ago, Musk showed up at a White House farewell party with a black eye. Trump had praised Musk then, calling him 'one of the greatest business leaders.' But this week, Trump said, 'I don't know if we will [have a relationship] anymore.' Musk replied on X, 'Whatever', as stated by had a public fight online after Trump threatened to cancel Musk's government Tesla lost $152 billion in value after the drama, which upset investors.
&w=3840&q=100)

Business Standard
an hour ago
- Business Standard
Andhra Pradesh government and Nvidia sign MoU to power AI University
The Government of Andhra Pradesh and Nvidia have entered into a Memorandum of Understanding (MoU) to jointly advance the establishment of a proposed Artificial Intelligence (AI) University and foster a robust AI ecosystem through skilling, research, infrastructure development, and startup acceleration. This collaboration is aimed at positioning Andhra Pradesh as a national leader in AI-driven innovation and talent development. As part of this partnership, the two parties will work together to skill 10,000 engineering students across Andhra Pradesh over the next two years. Nvidia will also provide curriculum guidance and technical training resources to support AI education and capacity building in engineering colleges across the state. In addition to workforce development, the MoU also focuses on enhancing research and development capabilities. Nvidia will support the identification and establishment of AI research centres that address pressing technological challenges and develop transformative solutions across sectors. Both parties will encourage joint research initiatives that contribute to the growth of AI knowledge and applications. Speaking on this development, Sh Nara Lokesh, Minister for IT, Government of Andhra Pradesh, said: 'This partnership with Nvidia marks a decisive step in our vision to position Andhra Pradesh as a national leader in artificial intelligence. By equipping 10,000 students with cutting-edge AI skills and supporting our startup ecosystem, we are laying the foundation for a future-ready economy driven by innovation, research, and entrepreneurship.' The collaboration will further extend to the development of advanced computational infrastructure required for the proposed AI University. Nvidia will assist in identifying the necessary tools, software platforms, and hardware capabilities to ensure the university is equipped to deliver world-class education and research outcomes. 'We are proud to collaborate with the Government of Andhra Pradesh in building a strong and inclusive AI ecosystem. This initiative reflects our commitment to democratising access to AI education, accelerating research, and enabling startups to innovate at scale. Together, we aim to create a model that can inspire similar efforts across the country,' said Vishal Dhupar, Managing Director, Asia South, Nvidia. Another key aspect of the MoU is the sharing of experience and best practices in establishing next-generation AI Factories. Nvidia will provide insights from its global expertise in operationalising AI Factories that serve as hubs for innovation, industry collaboration, and talent incubation aimed at democratisation of AI. The partnership also includes a strong focus on entrepreneurship. The Government of Andhra Pradesh intends to facilitate up to 500 AI-focused startups from the state in applying to the NVIDIA Inception programme during the term of this MoU, subject to the programme's eligibility criteria and availability. This initiative is expected to give a significant boost to the startup ecosystem in the region by providing emerging companies with access to Nvidia's global network, technical resources, and market opportunities. This MoU represents a significant milestone in Andhra Pradesh's ambition to become a hub for advanced AI research, education, and innovation. By combining the technological leadership of Nvidia with the vision of the Government of Andhra Pradesh, the initiative aims to build a sustainable and scalable AI ecosystem that delivers long-term economic and social value.