Up Next

94.8% of students favor video transcripts


When asked to rate the online video experience, 94% of students found that implemented videos are useful and beneficial. In addition 94.8% of students would recommend the addition of video transcripts on university websites.
The main purpose of video transcripts in education is to enable and increase student engagement especially amongst disabled students. Verbit’s proprietary technology has been fine-tuned and altered in order to focus on the specific individual needs of education field. This includes transcriptions and captions which must be as accurate as possible due to the fact that they are critical for learning.

Online learning and studying remotely is becoming increasingly popular as students can now learn and access information at their convenience. The sample was an accurate demonstration of how students now aim to learn online. It was interesting to compare and contrast the preference of multi-modal learning to structured traditional learning behaviors. Hopefully, more educational institutions will embody online learning strategies and make use of time synchronized transcription solutions.

University students are happy with the useful benefits Verbit has provided with online video transcripts and research has identified some of the keys reasons why students use online videos. They are as follows:


Research has shown that students enjoy online transcribed videos because they can move at their own pace which positively impacts the learning process.   In many cases, the ability to work at a faster pace due to transcripts has meant that they are able to complete tasks in a more timely manner using features such as searching for keywords and being able to jump to specific subjects. Just as we all have the ability have everything searchable at our fingertips,  the same ability should and is being allocated to students with educational content.

A respondent stated that, “its now easier to search for particular content in the video. No more manual sliding”.

Creating Study Guides

Another benefit includes having the ability to create study guides. A simple plugin will enable students to print or download a transcription. Some content which might be downloaded include lecture notes, flash cards, course study guides, power point presentations, and text book drafts.

Understanding English as a Second Language

When learning a new or second language, multi sensory learning is always the preferred method. Studies show that when people trying to recall or learn something strictly via memory, they respond better and retain more information when using  multi sensory formats rather than just visual formats alone. Multi sensory formats provide users and learners the opportunity to understand English via text.

Analyzing research raises the question of how online interactive videos will further benefit educational institutions. As we delve deeper and deeper into a world where are lives are influenced by modern technology, online learning will only become more and more relevant and required. This evidently means that universities will need to keep their technology systems up to date in order to maintain the cost effective implementation of user controlled and user friendly interfaces.

When using interactive transcribed video, the basis of the learning experience is having the ability to click where and when needed, rewind, skip, skim, pause, highlight, download, print, and much more. This is one of the most important advantages within the transcription market and precisely why students are utilizing these interactive transcription functionalities made available byVerbit.

Up Next

How to beat Google’s speech recognition technology

We’re pretty sure there’s not a person alive on this Earth who’s never heard of Google.

The internet platform has become the leading search engine in the world, setting the standards and raising the bar for digital technology algorithms. Google has since embarked on speech recognition technology which utilizes the concepts behind closed captioning and video transcripts.

In recent years they have made improvements on their speech recognition platforms. When speaking of AI developments, Google CEO Sundar Pichai said, “We’ve been using voice as an input across many of our products, that’s because computers are getting much better at understanding speech. We have had significant breakthroughs, but the pace even since last year has been pretty amazing to see. Our word error rate continues to improve even in very noisy environments. This is why if you speak to Google on your phone or Google Home, we can pick up your voice accurately.”

At Verbit, having taken Googles improvements into close consideration, we’ve compiled a list of ways to beat their speech recognition technology by doing it ourselves for our own models.

We are helping companies with their speech recognition needs and training their exact audio data with our proprietary transcription technology. While companies would buy generic data models from Fisher or others, we can impact the models with the customer’s own data.

The way that we do it is through a mix of technology and people.

At Verbit, we pride ourselves on having built an adaptive ASR (Automated Speech Recognition) technology to recognize all types of human voices, even with low quality audio and confusing terminology. Our proprietary ASR which is furthermore specifically trained for the domain of the customer, through the use of Artificial intelligence – something we at Verbit use to our advantage.


This is part of the three layer loop process which consists of the following:

  1. Proprietary ASR (Automated Speech Recognition) Technology– the process defined above. This layer is highly accurate creating (87%-95%) transcribed jobs in a matter of minutes.
  2. The transcript is then passed on to the editors and reviewers. Here they aim to ensure that the transcript becomes a error free transcript with more than +99% accuracy.
  3. The final layer is the assessment stage which is done in order to oversee any evident errors using AI. It’s also in this layer where the content is trained for new contexts and different accents.

By using a three layer loop process in Verbit’s voice recognition process, the accuracy and efficiency are always improving, as we are utilizing our own data to improve our acoustic algorithms. The hybrid model also makes for a excellent customer experience in that we are able to manage, monitor, and modify jobs in a timely yet effective manner.

Pricing, accuracy, and turnaround time has become Verbit’s significant benchmark and we use this as a platform to beat competitors such as Google in speech recognition technology.

Back To Top