Close Menu
clearpathinsight.org
  • AI Studies
  • AI in Biz
  • AI in Tech
  • AI in Health
  • Supply AI
    • Smart Chain
    • Track AI
    • Chain Risk
  • More
    • AI Logistics
    • AI Updates
    • AI Startups

Brink Bites: Using AI to Detect Alzheimer’s Disease; NIH Supports COPD Research in BU | The edge

October 17, 2025

NSF Announces Funding to Establish National AI Research Resources Operations Center | NSF

October 17, 2025

Cutting-edge imaging and AI research looking for tiny defects in chips

October 17, 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
clearpathinsight.org
Subscribe
  • AI Studies
  • AI in Biz
  • AI in Tech
  • AI in Health
  • Supply AI
    • Smart Chain
    • Track AI
    • Chain Risk
  • More
    • AI Logistics
    • AI Updates
    • AI Startups
clearpathinsight.org
Home»AI Applications & Case Studies»How this grassroots effort could make AI voices more diverse
AI Applications & Case Studies

How this grassroots effort could make AI voices more diverse

November 16, 2024003 Mins Read
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email Telegram WhatsApp
Follow Us
Google News Flipboard
Voices people2.jpg
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Ryakitimbo collected Kiswahili voice data in Tanzania, Kenya and the Democratic Republic of Congo. She tells me that she wanted to collect the voices of a socio-economically diverse set of Kiswahili speakers and that she reached out to women, young and old, living in rural areas, who were not always literate or n They didn’t even have access to the devices.

This type of data collection is challenging. The importance of AI voice data collection may seem abstract to many people, especially if they are not familiar with the technologies. Ryakitimbo and the volunteers approached women in settings where they initially felt safe, such as presentations on menstrual hygiene, and explained to them how technology could, for example, help disseminate information about menstruation. For women who could not read, the team read sentences which they repeated for the recording.

The Common Voice project is based on the belief that languages ​​are a very important part of identity. “We believe it’s not just about language, but also about transmitting culture and heritage and cherishing people’s particular cultural background,” says Lewis-Jong. “There are all kinds of idioms and cultural catchphrases that just don’t translate,” they add.

Common Voice is the only audio dataset in which English does not dominate, says Willie Agnew, a researcher at Carnegie Mellon University who has studied audio datasets. “I’m very impressed with the quality of their work and how they created this data set that’s actually quite diverse,” Agnew says. “It feels like they’re way ahead of almost every other project we’ve looked at.”

I spent some time checking recordings of other Finnish speakers on the Common Voice platform. As their voices echoed through my office, I felt surprisingly touched. We were all united around the same cause: making AI data more inclusive and ensuring our culture and language are properly represented in the next generation of AI tools.

But I had big questions about what would happen to my voice if I donated it. Once it was in the dataset, I would have no control over how it might be used afterwards. The tech sector isn’t really known for give people proper credit, and the data is accessible to everyone.

“While we want this to benefit local communities, there is a possibility that big tech will also use the same data and create something that then becomes a commercial product,” says Ryakitimbo. Although Mozilla doesn’t say who downloaded Common Voice, Lewis-Jong tells me that Meta and Nvidia have reported using it.

Open access to this rare and hard-won linguistic data is not something all minority groups want, says Harry H. Jiang, a researcher at Carnegie Mellon University who was part of the research team. audit. For example, Indigenous groups raised concerns.

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Posts

NCITE Insights No. 36 – AI CASE STUDY REPORT | National Center for Innovation, Technology and National Education (NCIT)

September 19, 2025

The use of artificial intelligence (AI) to generate case studies for the classroom – Focus of teachers

September 19, 2025

Deloiteaie uses box by type and industry organized collection of generative IA in cases of use of finances designed to help trigger ideas, reveal valuable driving deployments and define organizations on a road to …. June 18, 2025

September 18, 2025
Add A Comment
Leave A Reply Cancel Reply

Categories
  • AI Applications & Case Studies (29)
  • AI in Business (75)
  • AI in Healthcare (64)
  • AI in Technology (78)
  • AI Logistics (24)
  • AI Research Updates (42)
  • AI Startups & Investments (64)
  • Chain Risk (37)
  • Smart Chain (32)
  • Supply AI (21)
  • Track AI (33)

Brink Bites: Using AI to Detect Alzheimer’s Disease; NIH Supports COPD Research in BU | The edge

October 17, 2025

NSF Announces Funding to Establish National AI Research Resources Operations Center | NSF

October 17, 2025

Cutting-edge imaging and AI research looking for tiny defects in chips

October 17, 2025

AI is a strategic tool to improve scientific research

October 17, 2025

Subscribe to Updates

Get the latest news from clearpathinsight.

Topics
  • AI Applications & Case Studies (29)
  • AI in Business (75)
  • AI in Healthcare (64)
  • AI in Technology (78)
  • AI Logistics (24)
  • AI Research Updates (42)
  • AI Startups & Investments (64)
  • Chain Risk (37)
  • Smart Chain (32)
  • Supply AI (21)
  • Track AI (33)
Join us

Subscribe to Updates

Get the latest news from clearpathinsight.

We are social
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Reddit
  • Telegram
  • WhatsApp
Facebook X (Twitter) Instagram Pinterest
© 2025 Designed by clearpathinsight

Type above and press Enter to search. Press Esc to cancel.