With the greater need to provide support to students, employees, consumers and others who are Deaf or hard of hearing, more are captioning their content. However, many automatic Artificial Intelligence-based tools miss the mark.
It’s important to know what to look for in captioning tools to ensure they offer the necessary security to protect sensitive information, as well as reach the level of accuracy needed to offer equity to individuals who rely on captions to participate.
HIPAA Compliance and Security
Complying with the Health Insurance Portability and Accountability Act (HIPAA) is vital for various businesses and industries dealing with sensitive information and captioning it. These include telehealth, legal proceedings, medical schools and others. HIPAA violations can lead to civil and criminal penalties.
No one should use platforms like Zoom to discuss information that is HIPAA protected unless they implement the appropriate security. Free or basic business Zoom accounts and free captioning programs like Otter are not HIPAA compliant. Medical students need to understand these distinctions considering that most US medical schools now offer telemedicine courses.
The technology and captioning provided by Verbit were designed to meet robust privacy protections, including offering HIPAA compliant closed captioning on Zoom to help those who need to meet these requirements. Anyone using Zoom captioning or transcription software should take the time to understand why this security is critical for protecting patients, healthcare providers, attorneys and anyone else who is handling sensitive healthcare information.
ASR is Not Enough: Concerns Over Captioning, Accuracy and Accessibility
Privacy is not the only reason to select a closed captioning service carefully. Universities, businesses and others must meet high accuracy standards when it comes to their captioned content. The National Deaf Center (NDC) specifically points out that ASR is not (yet) accurate enough to provide adequate captions. The NDC states that offering flawed automated closed captioning fails to create an equitable experience for the Deaf and hard of hearing communities.
The UK government also requires a human editing layer for all of its online captioning. According to the UK, the World Wide Web Consortium’s Web Content Accessibility Guidelines (WCAG) create an expectation of accuracy that exceeds ASR’s capabilities.
Imagine that ASR is trying to interpret the sentence “I designed that red dress” but misinterprets “designed” with “declined.” The sentence becomes, “I declined that red dress.” While grammatically accurate and intelligible, that single substitution error changes the meaning of the sentence. Substitutions are one of the three error types common to ASR, along with deletions and insertions.
Throughout a long video or lecture, ASR that generates many errors may create captions that are incredibly challenging to follow. Not only could this lead to direct misunderstandings of the material, but it also causes distractions and requires that readers perform an extra step in digesting the information.
The rate of errors increases with accented speech, poor recording quality and uncommon terminology. Studies also indicate that when a Deaf person speaks in classrooms, business meetings or other venues, the technology is less capable of generating accurate captions. It is, therefore, not difficult to see why organizations like the NDC claim that ASR alone cannot create an equitable experience for those who rely on captions.
Accuracy and Legal Consequences
Caption accuracy is not just vital for the user’s experience but also to protect the organization from potential lawsuits. The National Association of the Deaf recently sued Harvard and MIT for discrimination based on their online content. The lawsuits pointed not only to a lack of captions but also to woefully inaccurate captions.
For instance, in one video, captions for the phrase “on our campus…” stated, “hot Campen good.” In another example, the captions for “these were either the government or partners of the government…” read “the square governmental Oct ago.” These errors illustrate how inaccurate captions can be useless to those who need them, while the lawsuit highlights the legal importance of offering accurate captions.
How Verbit Addresses Accuracy Concerns
Verbit’s captioning and transcription services tackle ASR’s accuracy challenges in two ways. First, Verbit uses mature AI that continues to learn and improve to accommodate heavy accents, new words and phrases. The user can also preprogram complex terms so that the captioning software is better at recognizing them. This feature makes Verbit incredibly helpful for professionals, academics and others who often use complex or industry-specific speech in lectures, meetings and events.
Additionally, even when using the best closed captioning software, a human transcriptionist needs to correct the ASR’s errors to ensure clean and readable transcript or captions for the user. Verbit’s solution adds this human layer to the transcription and captioning process.
Verbit’s solution uses technology to support human transcriptionists while offering robust privacy standards and 99%+ accuracy. For more information about Verbit’s captioning, transcription, audio description and subtitling services, contact us.