Up Next

How to Optimize Audio Quality for Transcription


Audio quality is the single most important factor for accurate transcription. Poor quality could result in errors or inaudible sections, decreasing the precision of the record. This also affects turnaround time and, in turn, cost.

Optimize the audio quality of your recordings and save time and money by following these 12 tips:

1. Pay attention to room acoustics

Keep in mind that large empty rooms tend to produce echoes, which decrease sound quality.  

2. Place microphones strategically throughout the room 

Making sure there are microphones in all key locations helps to ensure that all relevant spoken words are captured on the recording.

3. Place your microphone close to the speaker

The best spot is either directly below or to the side of the speaker’s mouth.

4.  Test your microphone or recording device

Conducting a trial run helps prevent any issues, while also allowing for better determination of the sound quality and the ideal spot for microphone placement.

5. Limit background noise & overlapping conversation

If there are multiple speakers being recorded, make sure each person speaks one at a time. Limit unnecessary noise as much as possible to ensure clarity  

6. Have speakers introduce themselves

Having participants introduce themselves at the beginning of the recording ensures that all speakers can be properly identified in the transcript

7. Repeat questions or important statements

This helps to ensure that nothing critical is missed on the recording and, consequently, in the transcript.

8. Create a consistent environment

For repeated recordings in the same location, try to recreate the same conditions so that the audio quality remains consistent in all sessions.

9. Record in the M4A file format

This format is ideal as it produces a high-quality sound file that is also small. MP3 and WAV format are also good choices.

10. Monitor input volume as you record

Make sure you stay in the ideal “green zone”. Deviating could mean that the volume is too high or too low, both of which negatively affect audio quality.

11. Use an audio limiter

This compresses noise into a particular decibel range so that the audio is not too loud or too soft.

12. Enhance audio with a sound editor

There are many options available, including reducing background noise, canceling echo, adjusting pitch, providing a volume boost, and more

Although a top quality transcription service will handle files with difficult audio and still deliver high accuracy and quick turnaround, following these best practices helps make the process smooth for customers and solution-providers alike.  

Up Next

Transcription Just Got a Lot Smarter: Introducing Verbit’s New Brand

Making the decision to rebrand your company is not an easy one. Doing it right requires significant thought, time, and energy. It also comes with a fair amount of pressure to a deliver a new look that exceeds all expectations and retains the essence of who you are.

So why did we do it?

Verbit began just two years ago with a mission to revolutionize the transcription and captioning space. This traditionally manual industry presented an incredible opportunity to introduce automation and greater speed, with artificial intelligence technology.

While that mission remains the same, a lot has changed since November 2016. We’ve grown from a trio of co-founders to a team of sixty people and counting. We’ve moved to a brand new office in the heart of Tel Aviv, with two additional offices in Kiev and in New York City. We’ve grown our customer base to over one hundred happy clients, and we’ve generated millions of dollars in revenue.

On the product side, we’ve developed state of the art speech-to-text technology that generates the most detailed and accurate files, thanks to smart, self-learning algorithms that continuously improve the precision of our transcripts and captions.

From day one, we set out to be the smartest transcription and captioning solution on the market. That’s our secret sauce. And that’s how we came up with our new tagline that encompasses what we’re all about: Transcription just got a lot smarter.

Read on to get to know us a bit better!

Who we are

We harness the power of artificial and human intelligence to provide the smartest transcription and captioning solution. Our customized technology is built on adaptive algorithms and generates the most detailed speech-to-text files to provide over 99% accuracy, delivered at record-breaking speed. Our smart AI technology supports on-demand CART services for real-time results.

What we do

We help make organizations smarter, using our innovative AI transcription and captioning technology. Why should an organization compromise on best-in-class speed or accuracy? We give the best of both, for the lowest cost.

So what’s with the little “v”?

Context isn’t just about words, it’s about what’s in between them too. Adding complementary information enriches the value and, when it comes to transcription and captioning, boosts the accuracy to near-perfect levels.

That’s the Verbit advantage in a nutshell. Our automated speech recognition technology is based on three models: linguistic, acoustic and contextual events. The latter is the real difference-maker, incorporating current events, latest news, and updates into the adaptive cycle to guarantee the highest transcription and captioning accuracy.

In short, the “v” represents the added value that Verbit’s solution provides, on top of just simple transcription and captioning. We go further by adding granular, detailed data to provide the best possible quality at the fastest speed, for the lowest cost.

Our Promise

At Verbit, we don’t stop at simply converting media files to text. We strive to help organizations maximize the potential of their audio and video files by making the information within searchable, accessible and actionable.

What we deliver

Dramatically lower costs: Automating the process ensures up to 50% reduction in operating costs

Enhanced customization: Adaptive technology is 100% personalized, according to your specifications

Record-breaking speed: Artificial and human intelligence deliver results 10x faster than the competition

Unparalleled accuracy: Adaptive, self-learning algorithms guarantee 99%+ accuracy

What’s next

The next generation of transcription and captioning is here, and it’s powered by smart AI. Verbit is on a mission to lead the way in speech-to-text technology and shape the future of the industry with disruptive technology. We’re redefining the way organizations utilize their audio and video assets to provide our customers with the tools they need to be efficient, profitable, and, above all, smart.

Back To Top