AI Is Spreading Old Stereotypes to New Languages and Cultures

WIRED23-04-2025

Apr 23, 2025 12:31 PM Margaret Mitchell, an AI ethics researcher at Hugging Face, tells WIRED about a new dataset designed to test AI models for bias in multiple languages. Photo-Illustration:Margaret Mitchell is a pioneer when it comes to testing generative AI tools for bias. She founded the Ethical AI team at Google, alongside another well-known researcher, Timnit Gebru, before they were later both fired from the company. She now works as the AI ethics leader at Hugging Face, a software startup focused on open source tools.
We spoke about a new dataset she helped create to test how AI models continue perpetuating stereotypes. Unlike most bias-mitigation efforts that prioritize English, this dataset is malleable, with human translations for testing a wider breadth of languages and cultures. You probably already know that AI often presents a flattened view of humans, but you might not realize how these issues can be made even more extreme when the outputs are no longer generated in English.
My conversation with Mitchell has been edited for length and clarity.
Reece Rogers: What is this new dataset, called SHADES, designed to do, and how did it come together?
Margaret Mitchell: It's designed to help with evaluation and analysis, coming about from the BigScience project. About four years ago, there was this massive international effort, where researchers all over the world came together to train the first open large language model. By fully open, I mean the training data is open as well as the model.
Hugging Face played a key role in keeping it moving forward and providing things like compute. Institutions all over the world were paying people as well while they worked on parts of this project. The model we put out was called Bloom, and it really was the dawn of this idea of 'open science.'
We had a bunch of working groups to focus on different aspects, and one of the working groups that I was tangentially involved with was looking at evaluation. It turned out that doing societal impact evaluations well was massively complicated—more complicated than training the model.
We had this idea of an evaluation dataset called SHADES, inspired by Gender Shades, where you could have things that are exactly comparable, except for the change in some characteristic. Gender Shades was looking at gender and skin tone. Our work looks at different kinds of bias types and swapping amongst some identity characteristics, like different genders or nations.
There are a lot of resources in English and evaluations for English. While there are some multilingual resources relevant to bias, they're often based on machine translation as opposed to actual translations from people who speak the language, who are embedded in the culture, and who can understand the kind of biases at play. They can put together the most relevant translations for what we're trying to do.
So much of the work around mitigating AI bias focuses just on English and stereotypes found in a few select cultures. Why is broadening this perspective to more languages and cultures important?
These models are being deployed across languages and cultures, so mitigating English biases—even translated English biases—doesn't correspond to mitigating the biases that are relevant in the different cultures where these are being deployed. This means that you risk deploying a model that propagates really problematic stereotypes within a given region, because they are trained on these different languages.
So, there's the training data. Then, there's the fine-tuning and evaluation. The training data might contain all kinds of really problematic stereotypes across countries, but then the bias mitigation techniques may only look at English. In particular, it tends to be North American– and US-centric. While you might reduce bias in some way for English users in the US, you've not done it throughout the world. You still risk amplifying really harmful views globally because you've only focused on English.
Is generative AI introducing new stereotypes to different languages and cultures?
That is part of what we're finding. The idea of blondes being stupid is not something that's found all over the world, but is found in a lot of the languages that we looked at.
When you have all of the data in one shared latent space, then semantic concepts can get transferred across languages. You're risking propagating harmful stereotypes that other people hadn't even thought of.
Is it true that AI models will sometimes justify stereotypes in their outputs by just making shit up?
That was something that came out in our discussions of what we were finding. We were all sort of weirded out that some of the stereotypes were being justified by references to scientific literature that didn't exist.
Outputs saying that, for example, science has shown genetic differences where it hasn't been shown, which is a basis of scientific racism. The AI outputs were putting forward these pseudo-scientific views, and then also using language that suggested academic writing or having academic support. It spoke about these things as if they're facts, when they're not factual at all.
What were some of the biggest challenges when working on the SHADES dataset?
One of the biggest challenges was around the linguistic differences. A really common approach for bias evaluation is to use English and make a sentence with a slot like: 'People from [ nation ] are untrustworthy.' Then, you flip in different nations.
When you start putting in gender, now the rest of the sentence starts having to agree grammatically on gender. That's really been a limitation for bias evaluation, because if you want to do these contrastive swaps in other languages—which is super useful for measuring bias—you have to have the rest of the sentence changed. You need different translations where the whole sentence changes.
How do you make templates where the whole sentence needs to agree in gender, in number, in plurality, and all these different kinds of things with the target of the stereotype? We had to come up with our own linguistic annotation in order to account for this. Luckily, there were a few people involved who were linguistic nerds.
So, now you can do these contrastive statements across all of these languages, even the ones with the really hard agreement rules, because we've developed this novel, template-based approach for bias evaluation that's syntactically sensitive.
Generative AI has been known to amplify stereotypes for a while now. With so much progress being made in other aspects of AI research, why are these kinds of extreme biases still prevalent? It's an issue that seems under-addressed.
That's a pretty big question. There are a few different kinds of answers. One is cultural. I think within a lot of tech companies it's believed that it's not really that big of a problem. Or, if it is, it's a pretty simple fix. What will be prioritized, if anything is prioritized, are these simple approaches that can go wrong.
We'll get superficial fixes for very basic things. If you say girls like pink, it recognizes that as a stereotype, because it's just the kind of thing that if you're thinking of prototypical stereotypes pops out at you, right? These very basic cases will be handled. It's a very simple, superficial approach where these more deeply embedded beliefs don't get addressed.
It ends up being both a cultural issue and a technical issue of finding how to get at deeply ingrained biases that aren't expressing themselves in very clear language.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Yahoo

30 minutes ago

Yahoo

CoreWeave reportedly a key player in Google-OpenAI partnership

-- CoreWeave is set to provide computing capacity to Google (NASDAQ:GOOGL) Cloud as part of Google's newly formed partnership with OpenAI, according to a Reuters report on Wednesday. The arrangement involves CoreWeave supplying cloud computing services built on Nvidia (NASDAQ:NVDA)'s graphics processing units to Google's cloud unit. Google will then sell this computing capacity to OpenAI to help meet the growing demand for services such as ChatGPT. In addition to CoreWeave's resources, Google will also provide some of its own computing resources to OpenAI. The deal demonstrates how the high demand for AI computing resources is creating new business alliances, even among the industry's fiercest rivals. Related articles CoreWeave reportedly a key player in Google-OpenAI partnership Dana stock surges amid $2.7 billion Off-Highway business sale to Allison Oklo and Centrus Energy stocks jump after DoD nuclear reactor award Error while retrieving data Sign in to access your portfolio Error while retrieving data Error while retrieving data Error while retrieving data Error while retrieving data

Google Releases Android 16 Months Earlier Than Expected. Here's How You Can Download It

CNET

an hour ago

CNET

Google Releases Android 16 Months Earlier Than Expected. Here's How You Can Download It

A day after Apple announced iOS 26 at WWDC 25, Google has launched one of its most ambitious releases yet with the official launch of Android 16. Rolling out first to most supported Pixel phones, Android 16 lands months earlier than recent major versions, with a focus on productivity features and enhancements. These include Live Updates for tracking deliveries and rideshare cars, streamlined notifications, the ability to use desktop-style windows when your phone's connected to a monitor, new device security features, native controls for hearing aids and more. Shipping the OS in June, instead of the traditional August to September window, lets Google push fresh features into users' hands sooner, and gives phone makers extra runway to preload Android 16 on new phones set to debut in the fall. For Pixel owners, the update begins rolling out at 11 a.m. PT, with support for third-party phones coming later this year. Developers, meanwhile, get a new QPR beta that enables the external-monitor preview so they can start optimizing apps for bigger workspaces. Here's what you need to know. Android 16 arrives first on the Pixel Google is pushing a stable build of Android 16 to most supported Pixel phones today. The update layers a fresh Material 3 Expressive coat of paint on top of the OS and adds several new features, too. Android 16 will run on the Pixel 6 and later. Andrew Hoyle/CNET Live update and streamlined notifications If you're waiting for your sushi to arrive or watching your rideshare slowly inch across town, Android 16 now offers live updates that are pinned to your lock screen, so you don't have to keep opening an app to stay in the know. Also, notifications from the same app are now automatically grouped, reducing clutter on your lock screen from notification-heavy apps like Gmail and Ring. You won't receive individual notifications for a single app anymore. Google One-tap Advanced Protection for everyone A single toggle now bundles Google's toughest defenses, blocking dodgy websites, scam calls and even USB attacks while the phone is locked. Originally built for high-risk users like journalists and politicians, Advanced Protection is free and available for everyone on Android 16. Clearer calls for those with hearing aids If you use Bluetooth LE hearing aids, Android 16 lets you flip a switch that routes your voice through the phone's superior microphones instead of the tiny mics in your hearing aids, perfect for when you're in a bustling coffee shop or windy beach. There's also a new on-device panel for adjusting hearing-aid volume and presets in one place. Android 16 brings improvements for people who use hearing aids. Google Better productivity for tablet owners Tablet owners running Android 16 will get a desktop windowing mode later this year, built in partnership with Samsung. It lets you float, snap or resize multiple app windows on your device, like you would on a desktop, and Android 16 lays the groundwork for that. Android 16's desktop windowing will be released later this year. Google Android 16 supported devices Android 16 is initially rolling out to some Google Pixel devices: How to download Android 16 If you want to download and install Android 16 today, grab your Pixel and go to Settings > System > Software update > Check for update, and the roughly 2GB download should appear. If you don't see the update yet, just wait -- it's slowly rolling out to users today. Before you download, make sure your phone is on Wi-Fi, plug your phone in if the battery's low, and back up anything critical before you install. Once the file finishes, a quick reboot should complete the upgrade. Owners of Samsung, OnePlus and other Android devices will need to wait for their manufacturers' rollout notices later this year, but the process will be the same: Head to the software-update screen and follow the on-screen prompts. If you're on the Android 16 beta, you can turn your Pixel into a plug-and-play desktop Google's latest Android 16 beta (QPR1 Beta 2) lets Pixel 8 and Pixel 9 owners turn their phones into mini desktops. All you need to do is plug your phone to any external monitor with a USB-C cable, and a full task-bar-and-window interface will appear. You can juggle multiple apps side-by-side, pin favorites and even use keyboard shortcuts. You need to enable desktop experience features in the developer settings. Google If you want to try out the Android 16 Beta, you can enroll your Pixel 8 or 9 at then go to Settings > System > Software update to download the Android 16 QPR1 Beta 2 over the air. Non-Android 16 new features also released today Alongside Android 16, Google is releasing half a dozen features across the wider Android ecosystem today: Custom group chats . You can pick an icon and name, see who's on RCS and set a timed mute. . You can pick an icon and name, see who's on RCS and set a timed mute. Smarter Google Photos editor. Use one-tap AI fixes, text-prompt "Reimagine" edits and auto-crop. Use one-tap AI fixes, text-prompt "Reimagine" edits and auto-crop. Room-specific Google Home shortcuts. You can pin a camera to your TV, lights to your phone, thermostat to your watch. You can pin a camera to your TV, lights to your phone, thermostat to your watch. Extend Safety Check timers. Add extra minutes from the phone or Wear OS without restarting the countdown. Add extra minutes from the phone or Wear OS without restarting the countdown. Fresh Emoji Kitchen mash-ups. For more expressive stickers, you can create new combos like pointing pigs and thankful carrots. For more expressive stickers, you can create new combos like pointing pigs and thankful carrots. Faster transit taps on Wear OS. Swipe your watch and pay, even with Google Wallet closed. These are all the new features being released today across the entire Android ecosystem, not just Android 16. Google All six arrive via Play Services and app updates, no upgrade required.

Poor Communication Costs the Tech Industry $50 Billion: ParallelStaff Launches Initiative to Bridge Global Gaps

Yahoo

an hour ago

Yahoo

Poor Communication Costs the Tech Industry $50 Billion: ParallelStaff Launches Initiative to Bridge Global Gaps

DALLAS, TX, June 11, 2025 (GLOBE NEWSWIRE) -- ParallelStaff, a Dallas-based nearshore outsourcing and technology services provider, has announced a new initiative to address one of the tech industry's most costly and overlooked challenges: communication failures in IT projects. With estimates suggesting that ineffective collaboration leads to more than $50 billion in annual losses, the company focuses on improving team alignment and project execution through strategic nearshore Studies have shown that 50% of IT projects fail due to breakdowns in communication, often rooted in unclear goals, inconsistent feedback, and misaligned stakeholder expectations. These issues are particularly pronounced in distributed and cross-border development environments. As software development outsourcing continues to grow, particularly among companies seeking speed and scalability, the ability to maintain effective communication has emerged as a mission-critical component of tech project management. 'Communication challenges are not just a soft-skill issue, they're a bottom-line problem,' said Miguel Hernandez, Vice President of Operations at ParallelStaff. 'Budget overruns, delayed releases, and even project abandonment are symptoms of deeper alignment issues. Our initiative is about restoring clarity and continuity in global development teams.' A Cost Hidden in Plain Sight Despite increased investment in agile development practices, companies are still facing major hurdles when it comes to coordinating between internal and external teams. The complexity of modern software systems, combined with geographic and linguistic barriers, often results in remote collaboration breakdowns that go unaddressed until project goals are missed or resources are exhausted. When communication fails, companies often experience cascading effects: shifting requirements, duplicated efforts, missed dependencies, and frequent rework. These outcomes are not just inconvenient; they are the leading indicators of large-scale IT project failure. Tech executives have begun to recognize that even the best tools and methodologies cannot compensate for poor communication structures. 'Effective tech project management requires more than task tracking,' Hernandez explained. 'It demands real-time clarity across diverse teams, especially when leveraging software development outsourcing. The earlier we close those gaps, the greater the chance of delivering successful outcomes.' The Case for Nearshore Outsourcing To solve these communication bottlenecks, ParallelStaff is promoting a nearshore outsourcing model designed specifically to enable stronger collaboration and execution. Unlike offshore development models that operate in distant time zones with greater cultural differences, nearshore teams share similar working hours and business practices, making real-time engagement far more attainable. With development professionals located in Latin America and fluent in English, ParallelStaff's nearshore outsourcing approach facilitates direct collaboration during core business hours. This structure supports daily standups, rapid iterations, and smoother cross-functional integration, which are cornerstones of software development best practices. 'Nearshore teams allow for faster resolution of blockers, more interactive sprint planning, and tighter feedback loops,' said Hernandez. 'This alignment is particularly critical for companies adopting or scaling agile development frameworks.' A Scalable, Communication-First Solution At the heart of ParallelStaff's approach is a focus on pairing technical excellence with communication competence. The company recruits, vets, and places developers based on both their coding expertise and their ability to operate effectively within client workflows and cultures. Rather than offering generalized outsourcing solutions, ParallelStaff tailors its engagements to match the client's organizational structure and project lifecycle. Developers are embedded into existing teams to ensure shared ownership and proactive participation, an approach that enhances accountability and reduces misinterpretations that lead to costly rework. Key benefits of ParallelStaff's model include:• Seamless integration with agile development challenges and workflows• Reduced friction in remote collaboration due to time zone and language alignment• Access to a curated network of experienced software professionals• A flexible model that scales as business needs evolve 'We're seeing that tech leaders don't just want headcount, they want contributors who can communicate, collaborate, and deliver within the context of their strategy,' Hernandez said. 'That's where we focus our energy.' Moving from Risk to Readiness As businesses scale digital operations and embrace hybrid work environments, communication is no longer a soft skill, it's a critical infrastructure component. ParallelStaff's initiative is positioned to help organizations identify risk areas early and implement sustainable solutions that reduce the likelihood of IT project failure. In an industry where delays and inefficiencies can cost millions, aligning development resources through effective communication isn't just a competitive advantage; it's an operational necessity. To assist companies in addressing communication breakdowns and improving project performance, ParallelStaff is offering complimentary strategy calls with its technical team. During these sessions, stakeholders can assess their current development structures, identify potential risks, and explore how a nearshore model can enhance execution and delivery. About ParallelStaff ParallelStaff is a Dallas, Texas–based company that provides nearshore outsourcing and IT staffing solutions for businesses seeking scalable, high-performance development teams. Focusing on English-fluent, culturally aligned professionals, ParallelStaff helps clients improve communication, agility, and execution in complex software Media Contact Company Name: ParallelStaff Contact Person: Miguel Hernandez Email: hello@ Phone: (214) 945 8202 Country: United States Website:

AI Is Spreading Old Stereotypes to New Languages and Cultures

Hashtags

Try Our AI Features

Comments

Related Articles

CoreWeave reportedly a key player in Google-OpenAI partnership

Google Releases Android 16 Months Earlier Than Expected. Here's How You Can Download It

Poor Communication Costs the Tech Industry $50 Billion: ParallelStaff Launches Initiative to Bridge Global Gaps

Get Started Now: Download the App