Project that makes automatic speech recognition tech more accessible is now sharing data with organisations
The Speech Accessibility Project, which aims to make automatic speech recognition technology more accessible to people with speech differences and disabilities, is now sharing some of its voice recordings and related data with universities, not-for-profit organisations, and companies.
First launched in October 2022 by the University of Illinois Urbana-Champaign in the US, the project is funded and backed by technology giants Amazon, Apple, Google, Meta, and Microsoft. The university is using the project to train voice recognition technologies to understand people with diverse speech patterns and disabilities.
Now, the Speech Accessibility Project team is accepting signed data use agreements and one-page proposals for recordings of 211 people with Parkinson’s.
The download also includes text of the original speech prompts and a transcript of the participants’ responses. A subset includes annotations describing the speech characteristics and how they affect participants. Additional recordings will be released each month.
“The goal of the Speech Accessibility Project is to make speech technology accessible to everyone,” said Mark Hasegawa-Johnson, the project’s leader and a professor of electrical and computer engineering at the University of Illinois Urbana-Champaign. “We’ve already shown that, by training an ASR using one part of this dataset, it’s possible to reduce error rates for people with Parkinson’s disease by almost a factor of two.
“We are hoping that other universities and companies will be inspired and challenged by that result to come up with hundreds of good ideas for using this data to make speech technology more accessible.”
The project’s data use agreement requires that any organisation using the data will not distribute it to a third party and prevent any unauthorised use. The agreement encourages those using the data to make their results publicly available and acknowledge the Speech Accessibility Project in any publications.
Data users must also agree not to use the data to attempt to identify anyone who participated in the project by recording their voice and promptly delete data from any participant who withdraws consent from the project.
Mark added: “This is a big milestone for us, especially for those of us who have close friends and family members living with Parkinson’s. We hope and intend that these data will make a measurable difference in people’s ability to access technology.”