WGBH Works with Nuance Communications, Inc. on Effort to Improve the Quality of Live News Captioning
Nuance Communications, Inc. today announced that it is working in collaboration with WGBH’s National Center for Accessible Media (NCAM) on a U.S. Department of Education-funded Project to develop a prototype system to automatically assess accuracy of real-time captions for live news programming. The system will identify and weigh the impact of different kinds of errors in closed captions.
The variations in quality and accuracy of real-time captioning have long been a concern to people who are deaf or hard of hearing. Captioned news in particular can be a lifeline to viewers who cannot hear well enough to follow program audio, a lifeline only as strong as the captioning is accurate. Live, or real-time captions are most often produced by specially trained court reporters, called stenocaptioners, who use a phonetic keyboard attached to a computerized dictionary to translate spoken words into text in a matter of a few seconds. National consumer advocacy organizations have petitioned the FCC to consider adding a standard of caption quality to existing mandates for quantity of captioning on television.
An early activity of NCAM’s “Caption Accuracy Metrics” Project was the design and launch of a web-based national consumer survey of television news caption viewers. Respondents were asked to read sample captions and identify error types that make captions hard to follow. Basic real-time captioning error types are word substitutions, deletions (dropped words or omissions) and insertions (addition of words not spoken).
WGBH is working with Nuance Communications, a leader in speech solutions, to develop customized language processing, data analysis, and benchmarking tools for the Project. Nuance is leveraging its Dragon Naturally Speaking speech recognition technology to develop these tools in order to quantify and further analyze the captioning data that is being gathered. Nuance’s work is integral to the Project’s development of a final prototype tool that will enable tracking of audio and captioning, comparing the spoken word with the caption output and rating the accuracy levels based on error type and severity.
In addition to software developed at WGBH, The Project is using Nuance’s Dragon speech recognition as a benchmarking and data analysis tool. The Project is not using speech recognition as a caption-creation source. Test data reports will not associate samples with specific television networks, programs or caption agencies.
Project staff are working with a team of advisors, including the National Institute of Standards and Technology (NIST), the federal technology agency that works with industry to develop and apply technology, measurements, and standards, Gallaudet University and the National Technical Institute for the Deaf.
NCAM staff recently convened a technical review panel of major stakeholders in caption quality at the CBS Broadcast Center in New York. Representatives from broadcast and cable networks, caption agencies, deaf education experts, and the National Court Reporters Association discussed the value of the project’s work to their organizations. In addition, the FCC recently announced that it is refreshing the record of comments on caption quality, solicited in two notices of proposed rule making over the past five years, and will determine with consumer and industry feedback if standards of quality should be set and, if so, in what manner.
“The phrase ‘a rising tide lifts all boats’ works well here,” said NCAM project director Marcia Brooks. “Being able to quantify caption errors has been the missing piece for a range of stakeholders. We expect the Project will help improve captioning quality through better and easier monitoring by caption agencies and networks.”
Funding for NCAM’s Caption Accuracy Metrics project is provided by a grant from the National Institute on Disability and Rehabilitation Research, U.S. Department of Education under grant #H133G080093-10.
About Nuance Communications, Inc.
Nuance (NASDAQ: NUAN) is a leading provider of speech, imaging and customer interaction solutions for businesses and consumers around the world. Its technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses experience Nuance’s proven applications and professional services. For more information, please visit: www.nuance.com.
About NCAM and WGBH
The Carl and Ruth Shapiro Family National Center for Accessible Media at WGBH is a research, development and advocacy entity that works to make existing and emerging technologies accessible to all audiences. NCAM (ncam.wgbh.org) is part of the Media Access Group at WGBH, which also includes The Caption Center (est. 1972), and Descriptive Video Service(r) (est. 1990). For more information, visit The Media Access Group at WGBH (access.wgbh.org).
Trademark reference: Nuance, the Nuance logo, and Dragon Naturally Speaking are registered trademarks or trademarks of Nuance Communications, Inc. or its affiliates in the United States and/or other countries. All other company names or product names referenced herein may be the property of their respective owners.