logo
Advanced AI models generate up to 50 times more CO₂ emissions than more common LLMs when answering the same questions

Advanced AI models generate up to 50 times more CO₂ emissions than more common LLMs when answering the same questions

Yahoo19-06-2025
When you buy through links on our articles, Future and its syndication partners may earn a commission.
The more accurate we try to make AI models, the bigger their carbon footprint — with some prompts producing up to 50 times more carbon dioxide emissions than others, a new study has revealed.
Reasoning models, such as Anthropic's Claude, OpenAI's o3 and DeepSeek's R1, are specialized large language models (LLMs) that dedicate more time and computing power to produce more accurate responses than their predecessors.
Yet, aside from some impressive results, these models have been shown to face severe limitations in their ability to crack complex problems. Now, a team of researchers has highlighted another constraint on the models' performance — their exorbitant carbon footprint. They published their findings June 19 in the journal Frontiers in Communication.
"The environmental impact of questioning trained LLMs is strongly determined by their reasoning approach, with explicit reasoning processes significantly driving up energy consumption and carbon emissions," study first author Maximilian Dauner, a researcher at Hochschule München University of Applied Sciences in Germany, said in a statement. "We found that reasoning-enabled models produced up to 50 times more CO₂ emissions than concise response models."
To answer the prompts given to them, LLMs break up language into tokens — word chunks that are converted into a string of numbers before being fed into neural networks. These neural networks are tuned using training data that calculates the probabilities of certain patterns appearing. They then use these probabilities to generate responses.
Reasoning models further attempt to boost accuracy using a process known as "chain-of-thought." This is a technique that works by breaking down one complex problem into smaller, more digestible intermediary steps that follow a logical flow, mimicking how humans might arrive at the conclusion to the same problem.
Related: AI 'hallucinates' constantly, but there's a solution
However, these models have significantly higher energy demands than conventional LLMs, posing a potential economic bottleneck for companies and users wishing to deploy them. Yet, despite some research into the environmental impacts of growing AI adoption more generally, comparisons between the carbon footprints of different models remain relatively rare.
To examine the CO₂ emissions produced by different models, the scientists behind the new study asked 14 LLMs 1,000 questions across different topics. The different models had between 7 and 72 billion parameters.
The computations were performed using a Perun framework (which analyzes LLM performance and the energy it requires) on an NVIDIA A100 GPU. The team then converted energy usage into CO₂ by assuming each kilowatt-hour of energy produces 480 grams of CO₂.
Their results show that, on average, reasoning models generated 543.5 tokens per question compared to just 37.7 tokens for more concise models. These extra tokens — amounting to more computations — meant that the more accurate reasoning models produced more CO₂.
The most accurate model was the 72 billion parameter Cogito model, which answered 84.9% of the benchmark questions correctly. Cogito released three times the CO₂ emissions of similarly sized models made to generate answers more concisely.
"Currently, we see a clear accuracy-sustainability trade-off inherent in LLM technologies," said Dauner. "None of the models that kept emissions below 500 grams of CO₂ equivalent [total greenhouse gases released] achieved higher than 80% accuracy on answering the 1,000 questions correctly."
RELATED STORIES
—Replika AI chatbot is sexually harassing users, including minors, new study claims
—OpenAI's 'smartest' AI model was explicitly told to shut down — and it refused
—AI benchmarking platform is helping top companies rig their model performances, study claims
But the issues go beyond accuracy. Questions that needed longer reasoning times, like in algebra or philosophy, caused emissions to spike six times higher than straightforward look-up queries.
The researchers' calculations also show that the emissions depended on the models that were chosen. To answer 60,000 questions, DeepSeek's 70 billion parameter R1 model would produce the CO₂ emitted by a round-trip flight between New York and London. Alibaba Cloud's 72 billion parameter Qwen 2.5 model, however, would be able to answer these with similar accuracy rates for a third of the emissions.
The study's findings aren't definitive; emissions may vary depending on the hardware used and the energy grids used to supply their power, the researchers emphasized. But they should prompt AI users to think before they deploy the technology, the researchers noted.
"If users know the exact CO₂ cost of their AI-generated outputs, such as casually turning themselves into an action figure, they might be more selective and thoughtful about when and how they use these technologies," Dauner said.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

China's 'robot Olympics' sees humanoids from 16 nations compete in table tennis, football, track
China's 'robot Olympics' sees humanoids from 16 nations compete in table tennis, football, track

Yahoo

time27 minutes ago

  • Yahoo

China's 'robot Olympics' sees humanoids from 16 nations compete in table tennis, football, track

China's 'robot Olympics' sees humanoids from 16 nations compete in table tennis, football, track By Liam Mo and Brenda Goh BEIJING (Reuters) -China kicked off a three-day long sports showcase for humanoid robots on Friday, looking to highlight progress in artificial intelligence and robotics with 280 teams from 16 countries competing in the World Humanoid Robot Games. Robots will compete in sports such as football, track and field, and table tennis, as well as tackle robot-specific challenges from sorting medicines and handling materials to cleaning services. Teams come from countries including the United States, Germany, and Brazil, with 192 representing universities and 88 from private enterprises. Robots from Chinese companies including Unitree and Fourier are among those competing. The Beijing municipal government is among the organising bodies for the event, underscoring the emphasis Chinese authorities place on the emerging robotics industry and reflecting the country's broader ambitions in AI and automation. The China's robotics push also comes as the country grapples with an ageing population and slowing economic growth. The sector has received government subsidies exceeding $20 billion over the past year, while Beijing plans to establish a one trillion yuan ($137 billion) fund to support AI and robotics startups. China has staged a series of high-profile robotics events in recent months, including what it called the world's first humanoid robot marathon in Beijing, a robot conference and the opening of retail stores dedicated to humanoid robots. However, the marathon drew criticism after several robot competitors emitted smoke during the race and some failed to complete the course, raising questions about the current capabilities of the technology. Still, while some may view such competitions and events as publicity stunts, industry experts and participants see them as crucial catalysts for advancing humanoid robots toward practical real-world applications. Morgan Stanley analysts in a report last week noted a surge in attendance to a recent robot conference from the general public compared to previous years, saying this showed "how China, not just top government officials, has embraced the concept of embodied intelligence." "We believe this widespread interest could be instrumental for China's continued leadership in the humanoid race, providing the necessary talent, resources, and customers to boost industry development and long-term adoption," they said. Booster Robotics, whose humanoid robots are being used by a Tsinghua University team in the football competition, views soccer as an effective test of perception, decision-making and control technologies that could later be deployed in factories or homes. "Playing football is a testing and training ground for helping us refine our capabilities," said Zhao Mingguo, Chief Scientist at Booster Robotics. Solve the daily Crossword

Apple's 'tabletop robot' companion rumored for 2027 launch
Apple's 'tabletop robot' companion rumored for 2027 launch

Yahoo

time27 minutes ago

  • Yahoo

Apple's 'tabletop robot' companion rumored for 2027 launch

Apple is still hard at work on becoming a relevant player in AI. The latest missive from Mark Gurman at Bloomberg suggests that Apple is shifting its artificial intelligence goals to center on new device segments. Sources reportedly told the publication that Apple has a slate of new smart home products in the works that could help pivot its lagging AI strategy. The center of the new lineup is a tabletop AI companion that has been described as an iPad on a movable robotic arm. It would be able to swivel to face the screen toward a user as they move around their home or office. Sources said the current prototype uses a horizontal display that's about seven inches while the motorized arm can move the screen about six inches away from the base in any direction. Equipped with a long-promised overhaul to the Siri voice assistant, this device could act like an additional person, recalling information, making suggestions and participating in conversations. According to Bloomberg, Apple is targeting a 2027 release for this product. Apple's new lineup is also rumored to include a smart home hub that is a simpler version of the robotic friend with no moving stand. We might be seeing this sooner, with a projected 2026 release for the device. This hub device would be able to control music playback, take notes, browse the web and host videoconferencing. Both the robot companion and the smart home hub are reportedly running a new operating system called Charismatic that's designed to support multiple users. The Siri running on the device will be given a particularly cheery personality, and it may also be getting a visual representation. Bloomberg's sources said there hasn't been a final decision on aesthetics; internal tests have had Siri looking like an animated Finder icon and like a Memoji. Today's scuttlebutt follows on previous reports from Gurman that pointed to Apple's interest in these categories. The idea of a smart home hub was apparently floated at the company as far back as 2022, and it's finally being rumored to have a formal debut some time this year. Robots have also been a topic of interest in Cupertino for some time, with claims that Apple was developing a personal robot dating back at least to last spring. While this Bloomberg piece offers more detail about those hypothetical plans, there's always a chance Apple will change direction or scrap a project.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store