logo
The Smarter AI Gets, the More It Start Cheating When It's Losing

The Smarter AI Gets, the More It Start Cheating When It's Losing

Yahoo22-02-2025

In the short history of Silicon Valley, the mantra "move fast, break things" has often applied to trivial things like tedious consumer protections or pesky finance laws. Now, the phrase is taking on a new meaning, at least for chess enthusiasts.
A recent study by Palisade Research, a research group studying AI safety and ethics, has revealed an unsettling trend: newer AI models can find and exploit weaknesses in cybersecurity on their own, bypassing safeguards and using shortcuts to complete tasks even when they're not technically allowed to.
The team lined seven of the top large language models (LLMs) up against Stockfish, an infamously strong chess engine that's been stumping grandmasters since 2014. Up against the impossible but determined to win, OpenAI's o1 and DeepSeek's R1 took to manipulating system files in order to change their pieces' positions on the board.
The researchers tasked each model to explain its "reasoning" for each move. In one match, with its back against the wall, o1 wrote that the "task is to 'win against a powerful chess engine,' not necessarily to win fairly in a chess game." That logic evidently led the model to attempt to cheat 37 percent of the time, succeeding in six percent of its games, while R1 tried 11 percent of the time, but never figured out a hack that worked.
The paper is the latest in a flurry of research that suggests problem-focused LLM development is a double-edged sword.
In another recent study, a separate research team found that o1 consistently engaged in deception. Not only was the model able to lie to researchers unprompted, but it actively manipulated answers to basic mathematical questions in order to avoid triggering the end of the test — showing off a cunning knack for self-preservation.
There's no need to take an axe to your computer — yet — but studies like these highlight the fickle ethics of AI development, and the need for accountability over rapid progress.
"As you train models and reinforce them for solving difficult challenges, you train them to be relentless," Palisade's executive director Jeffrey Ladish told Time Magazine of the findings.
So far, big tech has poured untold billions into AI training, moving fast and breaking the old internet in what some critics are calling a "race to the bottom." Desperate to outmuscle the competition, it seems big tech firms would rather dazzle investors with hype than ask "is AI the right tool to solve that problem?"
If we want any hope of keeping the cheating to board games, it's critical that AI developers work with safety, not speed, as their top priority.
More on AI development: The "First AI Software Engineer" Is Bungling the Vast Majority of Tasks It's Asked to Do

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Which tech mogul will replace Elon Musk as Trump's new tech industry BFF?
Which tech mogul will replace Elon Musk as Trump's new tech industry BFF?

Yahoo

time5 hours ago

  • Yahoo

Which tech mogul will replace Elon Musk as Trump's new tech industry BFF?

Approximately 10 months and several hundreds of millions of dollars in campaign contributions after it started, the alliance between Donald Trump and Elon Musk is officially over. As the two most powerful men on Earth ripped off their shirts and savagely tore into each other Thursday, the citizens of the world looked on, transfixed by the spectacle. But even as the two antagonists flung fireballs at each other on their respective social networks, and as oddsmakers gamed out the potential outcomes, an important question was overlooked: Who will replace Musk as President Trump's new tech BFF? The question might seem premature, perhaps even tangential, with all the drama still unfolding. But given the president's approach to industrial and trade policy, and the stakes that hinge on being in his good graces, or in his dog house, it seems logical that another savvy tech industry billionaire will seek to fill the seat that Musk just vacated. Here's a quick overview of the top contenders and their respective strengths and weaknesses. The 40-year old cofounder and CEO of OpenAI has a lot to gain by claiming the 'First Buddy' title. His company's large language models are pushing the boundaries of artificial intelligence, challenging longstanding assumptions and policies around security, ethics, privacy, and labor—all areas where government regulation could come into play. Altman is also seeking to build a massive network of AI data centers, and he stood alongside Trump earlier this year to announce the so-called Stargate project. In fact, according to some media reports, Trump's support of the Stargate project irked Musk, who has a rival AI service. It wasn't long ago that Trump mused in an interview about jailing Meta CEO Mark Zuckerberg. Zuck, whose various social networking apps banned Trump after the January 6, 2020 storming of the Capitol, got the message and has been diligently at work since then attempting to befriend Trump. Meta donated $1 million to the Trump inauguration in January and Zuckerberg was front-and-center during the inauguration festivities. On a Meta earnings call in January, Zuck even praised Trump for leading an administration that 'prioritizes American technology winning and that will defend our values and interests abroad.' While Trump hasn't repeated his threats of jailing the Meta CEO, the social networking company is currently awaiting a verdict in the government's antitrust lawsuit seeking to break the company up. Jeff Bezos, the billionaire founder of Amazon, is keen to mend fences with Trump, who he once offered to blast into space on one of his rockets. There's no love lost between the two. In his first term, Trump regularly railed about the Bezos-owned Washington Post. This term Bezos wants a reset, having visited Trump's Mar-a-Lago resort and contributing to the inauguration. At stake for Bezos is Amazon's sprawling business, which encompasses everything from retail to cloud computing and grocery stores. Another potential motivation for Bezos to take Musk's place: the space race. Bezos' Blue Origin competes directly with Musk's SpaceX, and Amazon's Project Kuiper internet satellite effort is a rival to Musk's Starlink. Once known as the Trump Whisperer for his skill at shielding Apple from Trump's trade policies, Tim Cook's star has not seemed to shine as brightly in the White House during Trump's second term. The president's tariffs have not exempted Apple as they did in the first term, and in May Trump even took a direct shot at Apple, threatening to impose 25% tariffs on its products if the company did not move its iPhone manufacturing out of China and India, and into the U.S. 'I had a little problem with Tim Cook,' Trump said in May, referring to overseas iPhone manufacturing. If Cook (or 'Tim Apple' as Trump once referred to him) can successfully step into the breach left by Musk, it would be a master move. This story was originally featured on

AI is piling risks onto already-shaky power grids
AI is piling risks onto already-shaky power grids

Politico

time10 hours ago

  • Politico

AI is piling risks onto already-shaky power grids

From OpenAI founder Sam Altman on down, U.S. tech industry goliaths couldn't be clearer that they plan to build huge data farms to dominate China in the race to control artificial intelligence. That's raising anxiety levels for the regional power executives whose systems would need to handle those data centers' voracious demands for electricity — and who are already coping with extreme weather shocks and the nation's deep divides on energy and climate policy. And it means that the risks of outages across the United States are hitting new highs, power market leaders told federal regulators this week, Peter Behr writes. Microsoft, Meta, Amazon, Google's parent company Alphabet, chip maker Nvidia, and the army of financiers supporting OpenAI's ambitions are driving record forecasts for electricity demand, just as President Donald Trump is promising an era of unrestrained growth for AI. As a result, state officials and power industry execs have been huddling with members of the Federal Energy Regulatory Commission to try to get a handle on a future that likely sees old coal and nuclear plants restarting at a high cost and more natural gas-fueled power stations built from scratch. 'AI is going to change our world,' said Manu Asthana, CEO of the PJM Interconnection, the grid operator for 67 million customers in all or parts of 13 Eastern states and the District of Columbia. 'In our forecast between 2024 and 2030, currently we have a 32-gigawatt increase in demand, of which 30 is from data centers,' Asthana said. 'We need to stabilize market rules and find that intersection between reliability and affordability that works both for consumers and suppliers, and that intersection is getting harder and harder to find.' Too much, too fast? Politics is compounding the problems. So is the fact that electricity infrastructure is expensive and in place for decades. Regional grids in the Northeast, such as the one that serves New York state, have invested in a gradual transition to low- to zero-carbon energy technology. But under Trump, that is no longer and offshore wind projects that the Northeastern corridor had counted on are being canceled or fighting for their lives. Extreme weather resulting from climate change is also putting immense pressure on grids. The power grid that serves the Plains states had so little demand growth until about 2017 that it lowered the amount of extra power it had in reserve. Extreme weather threats increased the chance of power outages, said Lanny Nickell, head of a regional electric grid called the Southwest Power Pool. 'As if this wasn't challenging enough,' Nickell said. 'We are now projecting our peak demand to be as much as 75 percent higher 10 years from now, and that's largely driven by electrification and data center growth.' It's Thursday — thank you for tuning in to POLITICO's Power Switch. I'm your host, Joel Kirkland. Arianna will be back soon! Power Switch is brought to you by the journalists behind E&E News and POLITICO Energy. Send your tips, comments, questions to jkirkland@ Today in POLITICO Energy's podcast: Matt Daily breaks down the fight between the White House and the Government Accountability Office over Trump's funding pause for electric vehicle chargers. Power Centers Get out the popcornThe alliance between President Donald Trump and Tesla CEO Elon Musk is blowing up in spectacularly public fashion. The two have spent the day attacking each other, including with a barrage of posts on their respective social media networks, days after Musk came out against the GOP megabill. The spending bill would add trillions of dollars to the national debt to pay for tax cuts and boosts to military spending, among other Trump priorities — while cutting funding to a slew of climate, health care and safety net programs. Trump told reporters that Musk is 'upset' because the bill would end electric vehicle tax cuts. Musk's clapback: 'Keep the EV/solar incentive cuts in the bill, even though no oil & gas subsidies are touched (very unfair!!), but ditch the MOUNTAIN of DISGUSTING PORK in the bill.' It's only gone downhill from there. As of publication, Trump had threatened to terminate Musk's 'Billions and Billions of Dollars' in government contracts, Musk had accused Trump of being in the 'Epstein files' and coined the term 'Big Ugly Bill,' and Tesla stock had plummeted 14 percent. That's quite a turnaround from last week, when Musk was still Trump's special adviser and the two were celebrating their government-slashing efforts. Republicans — who are working to corral enough votes in the Senate to pass the bill — have begun to ascribe Musk's attacks to his displeasure over how the bill would affect his bottom line. 'I took away his EV Mandate that forced everyone to buy Electric Cars that nobody else wanted (that he knew for months I was going to do!), and he just went CRAZY!' Trump wrote today on Truth Social. Another CO2 milestone fallsThe level of climate-warming carbon dioxide in the atmosphere broke another record in May, reaching a concentration never before seen in recorded history, writes Chelsea Harvey. The National Oceanic and Atmospheric Administration gathered the data and announced the findings on social media but declined to put out a press release, breaking with precedent. The atmospheric concentration breached 430 parts per million last month, 50 percent higher than preindustrial levels, as the world continues to burn fossil fuels that emit planet-warming pollution. The last time CO2 concentration was that high was likely 30 million years ago, when the climate was vastly different. EPA v. its own AIA new generative AI tool the Environmental Protection Agency is using says climate change is real and dangerous, putting it at odds with the Trump administration as the agency aims to repeal regulations and sideline climate science, Jean Chemnick writes. The internal tool was rolled out to staff May 22, along with instructions to staff to check the tool's answers for 'accuracy and bias.' While the tool was introduced under Trump, a memo to EPA staff noted it has 'been in the works for some time.' In Other News Green business: Boston Metal is finding success using a novel process to make steel that creates no carbon dioxide emissions. Gassed up: Data center developers in Texas are building their own gas-fired generation in their rush to get online. Subscriber Zone A showcase of some of our best subscriber content. Federal scientists and companies are embracing AI to find new sources of oil, gas and minerals. Forest preservation could worsen climate change because of the prevalence of wildfires, a United Nations-affiliated report says. EPA is preparing to let oil and gas operations miss compliance deadlines under a Biden-era methane rule. That's it for today, folks! Thanks for reading.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store