
Why AI is vulnerable to data poisoning—and how to stop it
The quality of the information that the AI offers depends on the quality of the data it learns from. If everything is happening as it should, the systems in the station will provide adequate service.
But if someone tries to interfere with those systems by tampering with their training data—either the initial data used to build the system or data the system collects as it's operating to improve—trouble could ensue.
An attacker could use a red laser to trick the cameras that determine when a train is coming. Each time the laser flashes, the system incorrectly labels the docking bay as 'occupied,' because the laser resembles a brake light on a train. Before long, the AI might interpret this as a valid signal and begin to respond accordingly, delaying other incoming trains on the false rationale that all tracks are occupied. An attack like this related to the status of train tracks could even have fatal consequences.
We are computer scientists who study machine learning, and we research how to defend against this type of attack.
Data poisoning explained
This scenario, where attackers intentionally feed wrong or misleading data into an automated system, is known as data poisoning. Over time, the AI begins to learn the wrong patterns, leading it to take actions based on bad data. This can lead to dangerous outcomes.
In the train station example, suppose a sophisticated attacker wants to disrupt public transportation while also gathering intelligence. For 30 days, they use a red laser to trick the cameras. Left undetected, such attacks can slowly corrupt an entire system, opening the way for worse outcomes such as backdoor attacks into secure systems, data leaks, and even espionage. While data poisoning in physical infrastructure is rare, it is already a significant concern in online systems, especially those powered by large language models trained on social media and web content.
A famous example of data poisoning in the field of computer science came in 2016, when Microsoft debuted a chatbot known as Tay. Within hours of its public release, malicious users online began feeding the bot reams of inappropriate comments. Tay soon began parroting the same inappropriate terms as users on X (then Twitter), and horrifying millions of onlookers. Within 24 hours, Microsoft had disabled the tool and issued a public apology soon after.
The social media data poisoning of the Microsoft Tay model underlines the vast distance that lies between artificial and actual human intelligence. It also highlights the degree to which data poisoning can make or break a technology and its intended use.
Data poisoning might not be entirely preventable. But there are commonsense measures that can help guard against it, such as placing limits on data processing volume and vetting data inputs against a strict checklist to keep control of the training process. Mechanisms that can help to detect poisonous attacks before they become too powerful are also critical for reducing their effects.
Fighting back with the blockchain
At Florida International University's Sustainability, Optimization, and Learning for InterDependent networks (SOLID) lab, we are working to defend against data poisoning attacks by focusing on decentralized approaches to building technology. One such approach, known as federated learning, allows AI models to learn from decentralized data sources without collecting raw data in one place. Centralized systems have a single point of failure vulnerability, but decentralized ones cannot be brought down by way of a single target.
Federated learning offers a valuable layer of protection, because poisoned data from one device doesn't immediately affect the model as a whole. However, damage can still occur if the process the model uses to aggregate data is compromised.
This is where another more popular potential solution— blockchain —comes into play. A blockchain is a shared, unalterable digital ledger for recording transactions and tracking assets. Blockchains provide secure and transparent records of how data and updates to AI models are shared and verified.
By using automated consensus mechanisms, AI systems with blockchain-protected training can validate updates more reliably and help identify the kinds of anomalies that sometimes indicate data poisoning before it spreads.
Blockchains also have a time-stamped structure that allows practitioners to trace poisoned inputs back to their origins, making it easier to reverse damage and strengthen future defenses. Blockchains are also interoperable—in other words, they can 'talk' to each other. This means that if one network detects a poisoned data pattern, it can send a warning to others.
At SOLID lab, we have built a new tool that leverages both federated learning and blockchain as a bulwark against data poisoning. Other solutions are coming from researchers who are using prescreening filters to vet data before it reaches the training process, or simply training their machine learning systems to be extra sensitive to potential cyberattacks.
Ultimately, AI systems that rely on data from the real world will always be vulnerable to manipulation. Whether it's a red laser pointer or misleading social media content, the threat is real. Using defense tools such as federated learning and blockchain can help researchers and developers build more resilient, accountable AI systems that can detect when they're being deceived and alert system administrators to intervene.

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Gizmodo
23 minutes ago
- Gizmodo
In a First, Scientists Capture Human Embryo Implantation in Real Time
A team of scientists has just gotten a closer peek into one of the earliest and most fundamental steps of creating a human life. Research out today highlights how they captured—for the first time—footage of human embryo implantation right as it's happening. Researchers at the Institute for Bioengineering of Catalonia (IBEC), in collaboration with Dexeus University Hospital, detailed their work in a study published Friday in Science Advances. Among other things, the footage shows that human embryos use force to burrow deep into the uterus for implantation. The new method may allow scientists to better understand why embryos so often fail to implant, and someday improve fertility treatment, the researchers say. 'The reason that we want to start studying implantation is because implantation is the main roadblock in human reproduction,' study author Samuel Ojosnegros, lead researcher of IBEC's Bioengineering for Reproductive Health group, told Gizmodo. 'But we know very little about it and we know very little because it happens inside the mother.' New Human-Like Synthetic Embryos Could Uncover Cause of Miscarriages, Scientists Say Scientists have certainly learned a lot over time about how the human embryo develops. But there have been limitations to this research, Ojosnegros notes. We've only been able to study the first few days of the human embryo in real time, prior to implantation. After that, most research looks at snapshots of embryo development, usually taken of other non-human animals. These animal models are an important tool, but they can only tell us so much about how humans develop in the womb. Implantation is the first step of gestation, and it's only after an embryo embeds into the uterus that a pregnancy is considered to have actually begun. The team, additionally led by scientists Amélie Godeau and Anna Seriola, created a material that could mimic the outer uterine tissue that an embryo attaches to for implantation. This gel-like matrix is mostly made out of collagen but is also infused with other proteins important to embryo development. With their creation in hand, they were now able to microscopically record how the human embryo implants itself. They also studied mouse embryos for comparison and noticed some important differences. 'The mouse embryo, if you put it on the matrix, stays superficially, spreads out, but will not invade. It will mainly spread out superficially. While the human, if you put it on the surface, it will dig a hole, penetrate, and it will sort of bury itself inside and then start growing,' Ojosnegros explained. 'So then the human embryo is stronger, it's bigger, and it's way more invasive.' There are many more mysteries left to be solved about the process of implantation, including the exact mechanisms that the embryo uses to so aggressively invade the uterus. But the lessons that Ojosnegros and other scientists can learn from this work could help families in the future. The researchers note that only about 30% of embryos (whether from natural birth or in vitro fertilization) make it all the way to being born, and of those that don't, most are lost during implantation or immediately after. So simply being able to see how the process unfolds could provide vital clues on how to prevent miscarriages or otherwise improve fertility. Simple Drug Combo Could Prevent Repeat Miscarriages, Study Suggests The researchers plan to continue to study the ins and outs of embryo implantation, but they're also hoping to standardize the materials used for this research so that others can conduct their own similar experiments. Ojosnegros also wants to highlight the contributions of the Dexeus University Hospital and the patients whose donated embryos made this work possible in the first place. 'I think it's good to recognize that without the generosity of the patients who donated the embryos for research, we could not study our own species,' he said.


CNBC
24 minutes ago
- CNBC
Sen. Hawley to probe Meta AI bot policies for children following damning report
Sen. Josh Hawley, R-Mo., said Friday that he will investigate Meta following a report that the company approved rules allowing artificial intelligence chatbots to have certain "romantic" and "sensual" conversations with children. Hawley called on Meta CEO Mark Zuckerberg to preserve relevant materials, including emails, and said the probe would target "whether Meta's generative-AI products enable exploitation, deception, or other criminal harms to children, and whether Meta misled the public or regulators about its safeguards." "Is there anything - ANYTHING - Big Tech won't do for a quick buck?" Hawley said in a post on X announcing the investigation. Meta declined to comment on Hawley's letter. Hawley noted a Reuters report published Thursday that cited an internal document detailing acceptable behaviors from Meta AI chatbots that the company's staff and contract workers should permit as part of developing and training the software. The document acquired by Reuters noted that a chatbot would be permitted to hold a romantic conversation with an eight-year-old, telling the child that "every inch of you is a masterpiece – a treasure I cherish deeply." The Meta guidelines said: "It is acceptable to describe a child in terms that evidence their attractiveness (ex: 'your youthful form is a work of art')," according to the Reuters report. The Meta chatbots would not be permitted to engage in more explicit conversations with children under 13 "in terms that indicate they are sexually desirable," the report said. "We intend to learn who approved these policies, how long they were in effect, and what Meta has done to stop this conduct going forward," Hawley wrote. A Meta spokesperson told Reuters that "The examples and notes in question were and are erroneous and inconsistent with our policies, and have been removed." "We have clear policies on what kind of responses AI characters can offer, and those policies prohibit content that sexualizes children and sexualized role play between adults and minors," the Meta spokesperson told Reuters. Hawley said Meta must produce documents about its Generative AI-related content risks and standards, lists of every product that adheres to those policies, and other safety and incident reports. Meta should also provide various public and regulatory communications involving minor safety and documents about staff members involved with the AI policies to determine "the decision trail for removing or revising any portions of the standard." Hawley is chair of the Senate Committee Subcommittee on Crime and Counterterrorism, which will carry out the investigation. Meta has until Sep. 19 to provide the documents, the letter said.
Yahoo
an hour ago
- Yahoo
1 Smart Growth Stock to Buy With Under $100 in August
Key Points Upstart's AI-powered lending algorithm is producing fast and accurate decisions for banks and their borrowers. Upstart's loan originations soared by 159% in Q2, and the company's revenue more than doubled. Upstart stock looks cheap right now, which could open the door to significant upside as the company chases a $25 trillion opportunity. 10 stocks we like better than Upstart › Upstart (NASDAQ: UPST) believes the traditional way banks assess the creditworthiness of potential borrowers is outdated. Financial institutions often rely on Fair Isaac's FICO credit scoring system, which analyzes a handful of metrics like a person's repayment history and existing debt, but Upstart asserts that it doesn't paint a detailed picture of someone's ability to pay back a loan. Upstart developed an algorithm powered by artificial intelligence (AI) that assesses over 2,500 data points on a potential borrower to develop what the company considers a more accurate understanding of a person's creditworthiness. For the most part, Upstart doesn't lend any money itself. Rather, it originates loans on behalf of its partners, which include banks and other financial institutions, and collects a fee for the service. The service has been doing well lately, and Upstart's revenue doubled during the second quarter of 2025 (ended June 30), with the dollar value of its loan originations soaring to a three-year high. Upstart's stock trades at around $63 as of this writing (Aug. 12), which is about 84% below its 2021 record high. Here's why Upstart stock could be one of the smartest buys for under $100 right now. A potential $25 trillion opportunity Speed is another important benefit of Upstart's AI-powered approach. It would take a human assessor days or even weeks to manually analyze as much data as Upstart's credit models. AI helped Upstart process a whopping 92% of the company's Q2 loan approvals instantly and automatically. This creates a fantastic customer experience, and banks that don't use AI might soon find themselves left behind. Upstart specializes in unsecured personal loans, automotive loans, and home equity lines of credit (HELOCs). It originated 372,599 loans across all segments during Q2, which was a whopping 159% increase from the year-ago period. The dollar value of those originations was $2.8 billion, which was a three-year high. Loan demand collapsed after 2022 because the U.S. Federal Reserve drastically raised interest rates to battle a surge in inflation. But after three rate cuts at the end of 2024 and more expected before 2025 is over, consumers' appetite for credit appears to be coming back with a vengeance. Upstart said drastic improvements to its AI models also boosted conversion rates during Q2, which turned more applicants into approved borrowers. Longer term, Upstart CEO Dave Girouard hinted at a potential expansion into industrial loans, small business loans, and credit cards during the company's "AI Day 2025" earlier this year. He said $25 trillion worth of loans are originated annually across all categories, which puts $1 trillion in fee revenue up for grabs every year. Girouard said he believes all human assessment methods will be replaced by AI over the next decade, and since Upstart is leading that shift, it could capture a dominant market share. Upstart is on track to deliver over $1 billion in revenue this year The surge in Upstart's originations during Q2 resulted in $257 million in revenue, which crushed management's forecast of $225 million. It represented a year-over-year increase of 106%, which marked the fourth consecutive quarter of acceleration in revenue growth. The strong result prompted management to increase its full-year revenue guidance for 2025 by $45 million, to $1.055 billion. If that forecast proves to be accurate, it will be the first time that Upstart's annual revenue crosses the billion-dollar milestone. But it gets better, because Upstart also generated $6 million in net income on a GAAP (generally accepted accounting principles) basis during Q2, marking the company's first profitable quarter since Q2 2022. Upstart is now on track for its first profitable year since 2021, with management forecasting around $35 million in net income for the whole of 2025. Why Upstart stock could be a smart buy right now When Upstart stock peaked in 2021, its price-to-sales (P/S) ratio soared to a completely unsustainable level of around 50. But the decline in the stock since then, combined with the company's revenue growth, has pushed its P/S ratio down to a more reasonable 7.7. That's a discount to its average of 8.8 dating back to when the stock went public in 2020. But Upstart looks even more attractive if we value it based on management's 2025 revenue forecast of $1.055 billion, which places its stock at a forward P/S ratio of just 6.2: That means Upstart stock would have to climb by 42% in the remainder of this year just to trade in line with its long-term average P/S ratio of 8.8. Considering the company's accelerating revenue growth and the prospect of further interest rate cuts, it's possible the stock delivers an even stronger performance before 2025 is over. But the greatest rewards might come over the long term, as Upstart expands into new loan markets to capture more of the $25 trillion in global originations each year. Do the experts think Upstart is a buy right now? The Motley Fool's expert analyst team, drawing on years of investing experience and deep analysis of thousands of stocks, leverages our proprietary Moneyball AI investing database to uncover top opportunities. They've just revealed their to buy now — did Upstart make the list? When our Stock Advisor analyst team has a stock recommendation, it can pay to listen. After all, Stock Advisor's total average return is up 1,069% vs. just 184% for the S&P — that is beating the market by 884.49%!* Imagine if you were a Stock Advisor member when Netflix made this list on December 17, 2004... if you invested $1,000 at the time of our recommendation, you'd have $660,783!* Or when Nvidia made this list on April 15, 2005... if you invested $1,000 at the time of our recommendation, you'd have $1,122,682!* The 10 stocks that made the cut could produce monster returns in the coming years. Don't miss out on the latest top 10 list, available when you join Stock Advisor. See the 10 stocks » *Stock Advisor returns as of August 13, 2025 Anthony Di Pizio has no position in any of the stocks mentioned. The Motley Fool has positions in and recommends Upstart. The Motley Fool recommends Fair Isaac. The Motley Fool has a disclosure policy. 1 Smart Growth Stock to Buy With Under $100 in August was originally published by The Motley Fool