Godfather of AI Alarmed as Advanced Systems Quickly Learning to Lie, Deceive, Blackmail and Hack

07-06-2025

A key artificial intelligence pioneer is concerned by the technology's growing propensity to lie and deceive — and he's founding his own nonprofit to curb such behavior.
In a blog post announcing LawZero, the new nonprofit venture, "AI godfather" Yoshua Bengio said that he has grown "deeply concerned" as AI models become ever more powerful and deceptive.
"This organization has been created in response to evidence that today's frontier AI models have growing dangerous capabilities and [behaviors]," the world's most-cited computer scientist wrote, "including deception, cheating, lying, hacking, self-preservation, and more generally, goal misalignment."
Of all people, Bengio would know. In 2018, the founder of the Montreal Institute for Learning Algorithms (MILA) was presented with a Turing Award alongside fellow AI pioneers Yann LeCun and Geoffrey Hinton for their formative roles in machine learning research, and he was listed as one of Time magazine's "100 Most Influential People" in 2024 thanks to his outsize impact on the ever-accelerating technology.
Despite the accolades, Bengio has repeatedly expressed regret over his role in bringing advanced AI technology — and its Silicon Valley hype cycle — to fruition. This latest missive seems to be his most stark to date.
"I'm deeply concerned," the AI pioneer wrote in his blog post, "by the behaviors that unrestrained agentic AI systems are already beginning to exhibit."
Bengio pointed to recent red-teaming experiments, or tests that push AI models to their limits to see how they'll act, showing that advanced systems have developed an uncanny tendency to keep themselves "alive" by any means necessary. Among his examples was a recent report from Anthropic detailing how its Claude 4 model, when told it would be shut down, threatened to blackmail an engineer with incriminating emails if they followed through.
"These incidents," the decorated researcher wrote, "are early warning signs of the kinds of unintended and potentially dangerous strategies AI may pursue if left unchecked."
To put such behavior in check, Bengio said that his new nonprofit is building a so-called "trustworthy" model, which he calls "Scientist AI," that is "trained to understand, explain and predict, like a selfless idealized and platonic scientist."
"Instead of an actor trained to imitate or please people (including sociopaths), imagine an AI that is trained like a psychologist — more generally a scientist — who tries to understand us, including what can harm us," he explained. "The psychologist can study a sociopath without acting like one."
A pre-peer-review paper Bengio and his colleagues published earlier this year explains it a bit more simply.
"This system is designed to explain the world from observations," the paper reads, "as opposed to taking actions in it to imitate or please humans."
The concept of building "safe" AI is far from new, of course — it's quite literally why several OpenAI researchers left OpenAI and founded Anthropic as a rival research lab.
This one seems to be different because, unlike Anthropic, OpenAI, or any other companies that pay lip service to AI safety while still bringing in gobs of cash, Bengio's is a nonprofit — though that hasn't stopped him from raising $30 million from the likes of ex-Google CEO Eric Schmidt, among others.
More on creepy AI: Advanced OpenAI Model Caught Sabotaging Code Intended to Shut It Down

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Yahoo

16 minutes ago

Yahoo

Meta Restructures AI Group, Again

Meta is restructuring its AI group, splitting it into four teams in a move to accelerate the company's pursuit of superintelligence. Bloomberg's Riley Griffin discusses the details with Caroline Hyde on "Bloomberg Tech." Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Astera Labs Announces Third Quarter 2025 Financial Conference Participation

Yahoo

16 minutes ago

Yahoo

Astera Labs Announces Third Quarter 2025 Financial Conference Participation

SAN JOSE, Calif., Aug. 20, 2025 (GLOBE NEWSWIRE) -- Astera Labs, Inc. (Nasdaq: ALAB), a leader in semiconductor-based connectivity solutions for rack-scale AI infrastructure, today announced its participation in financial conferences for the third quarter 2025. Deutsche Bank 2025 Technology Conference on Aug. 28, 2025. Astera Labs' presentation is scheduled for 12:30 pm PT. Citi's 2025 Global TMT Conference on Sept. 4, 2025. Astera Labs' presentation is scheduled for 8:50 am ET. A webcast of each session will be made available on Astera Labs' investor relations website at About Astera Labs Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions grounded in open standards. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs' Intelligent Connectivity Platform integrates CXL®, Ethernet, PCIe®, and UALink™ semiconductor-based technologies with the company's COSMOS software suite to unify diverse components into cohesive, flexible systems that deliver end-to-end scale-up, and scale-out connectivity. Discover more at IR CONTACT: Leslie

Lam Research Corporation Announces Participation at Upcoming Conferences

Yahoo

16 minutes ago

Yahoo

Lam Research Corporation Announces Participation at Upcoming Conferences

FREMONT, Calif., Aug. 20, 2025 /PRNewswire/ -- Lam Research Corp. (Nasdaq: LRCX) today announced that Doug Bettinger, Executive Vice President and Chief Financial Officer, will participate in the following upcoming investor events: Citi 2025 Global TMT Conference on September 3, 2025, at 5:50 a.m. Pacific Daylight Time (8:50 a.m. Eastern Daylight Time) Goldman Sachs Communacopia + Technology Conference on September 10, 2025, at 8:10 a.m. Pacific Daylight Time (11:10 a.m. Eastern Daylight Time) Live audio webcast of these presentations will be available to the public and can be accessed from the Investors' section of Lam's website at A replay of the audio webcasts will be available for two weeks after the presentation date. About Lam Research Lam Research Corporation (NASDAQ: LRCX) is a global supplier of innovative wafer fabrication equipment and services to the semiconductor industry. Lam's equipment and services allow customers to build smaller and better performing devices. In fact, today, nearly every advanced chip is built with Lam technology. We combine superior systems engineering, technology leadership, and a strong values-based culture, with an unwavering commitment to our customers. Lam Research is a FORTUNE 500® company headquartered in Fremont, California, with operations around the globe. Learn more at (LRCX). IR Contact: Ram GaneshInvestor Relations(510) Source: Lam Research Corporation, (Nasdaq: LRCX) View original content to download multimedia: SOURCE Lam Research Corporation

Godfather of AI Alarmed as Advanced Systems Quickly Learning to Lie, Deceive, Blackmail and Hack

Hashtags

Try Our AI Features

Comments

Related Articles

Meta Restructures AI Group, Again

Astera Labs Announces Third Quarter 2025 Financial Conference Participation

Lam Research Corporation Announces Participation at Upcoming Conferences

Get Started Now: Download the App