Ai-Media’s Smart ASRTMuses the power of artificial intelligence and our human-curated custom dictionaries layered onto existing ASR engines to deliver results significantly better than standard ASR products in the market.
Co-Founder, Director and CEO
“We combined our automated technologies with the knowledge and skill of our expert captioning team to create the best ASR solution out there.”
What is Smart ASR™
Layering our human curation onto the ASR, Smart ASR hits a sweet spot in terms of the benefit to the customer and the price point.
Performed by a team with over a decade of expertise in training language software to produce accurate live captions, curating and crafting preparation materials, delivering live broadcast captions, and implementing broadcast technology and infrastructure on a global scale.
For each session, our expert captioning team conducts in-depth research using our specialised in-house database and customer-provided documentation. They use this to compile names, terms, phrases, spellings and pronunciations tailored to the needs of the session, and feed them into our comprehensive custom dictionaries and custom captioning filter.
Models have been refined using our more than 10 years of human expertise and data. Our custom dictionaries – overseen by our highly skilled team – teach the ASR engine key names and phrases tailored to every captioning session and its particular subject matter, as well as phonetic pronunciations. They refine accuracy for particularly challenging terms and apply customer-specific formatting, standards and any censorship requirements to create the best ASR captions possible.
Automation brings everything together. With the influence of our expert captioning team and the power of our custom dictionaries, it delivers the optimal ASR caption accuracy thanks to our understanding of the subject matter in any given context.
Not all Automatic Speech Recognition (ASR) is created equal.
“Our solution utilizes human-curated custom dictionaries and custom caption filtering, adding a layer of refinement to the raw ASR output resulting in greater accuracy”
– Ai-Media, Product Team
How does Smart ASR™ compare on Accuracy?
The accuracy of live captions varies greatly. There are several options in the market that can deliver according to the needs of the consumer.
Out-of-the-box ASR – which includes the free captions available on Zoom, YouTube and Google – has no human input and, as a result, the lowest accuracy. It is best-suited to casual meetings where accuracy is not an important consideration.
Current industry leader in ASR: Lexi
EEG’s Lexi product currently tops the industry in ASR live captioning solutions. It is better quality than what out-of-the-box captions can offer, and is suited to live streaming and live broadcast situations where some level of accuracy is needed, but errors are acceptable.
Ai-Media’s Smart ASR™:
By laying Ai-Media’s technology on existing ASR products, Smart ASR delivers a significant improvement over the performance of standard ASR products in the market.
It achieves accuracy outcomes approximately halfway between generic out-of-the-box ASR and Ai-Media’s premium service – representing a ground-breaking development in the industry.
Ai-Media Premium Live Captions
For those who need the highest quality captioning available, Ai-Media’s premium, human captioning service remains the top choice. This service features high-quality live captions generated by Ai-Media’s skilled and experienced human captioners.
It is the best choice for content with multiple speakers and accents, and environments with poorer audio quality.
Let us help you find the right solution for your specific needs.
Each service offered is unique and has specific applications. Our team will gladly explain each method and assist you in choosing the most appropriate service for your specific need.
Smart ASR is designed to meet a gap in the market where ASR ‘out-of-the-box’ is not high-quality enough and our premium human-captioned service is not affordable enough for the job.
Smart ASR is particularly well suited to live broadcast situations that use studio-quality audio and a predictable dictionary of defined terms. It is ideal for live news, weather segments and one-on-one style broadcasts. It is the perfect answer for broadcasters who need a scalable live captioning solution at a lower price than our premium service.
Frequently Asked Questions
What is Smart ASR?
Ai-Media’s Smart ASR is a groundbreaking live captioning solution that is automated using machine learning and human-curated by our expert team. Our Smart ASR represents the next generation automated live captioning, thanks to the skill and technical experience of our captioning curation team, our custom dictionaries and our artificial intelligence automation.
What is Smart ASR best used for?
Ai-Media’s Smart ASR is a fantastic option for those wanting live captions at a lower cost than premium human captioning and a higher accuracy than ‘out-of-the-box’ automated captions. It is perfect for live settings with a single speaker and a clear audio feed, with minimal background noise, music or overlapping dialogue. This includes live broadcast: for example, live news, one-on-one interviews and weather segments.
How accurate are Smart ASR captions?
Ai-Media’s Smart ASR uses human-curated custom dictionaries and artificial intelligence automation to add a layer of human refinement to raw ASR output, meaning better accuracy for you and your users. The accuracy of our Smart ASR captions is significantly higher than out-of-the-box ASR. Accuracy will also vary from session to session, depending on the quality of the audio feed and the accent of the speakers.
How is Ai-Media’s ASR solution different from its competitors?
The Ai-Media ASR difference comes down to our expert global team and our in-house technical development. In our team, we have decades of experience in understanding which terms and phrases are typically difficult to caption, and we have built our own end-to-end caption delivery system to make the process as smooth as possible. As a result, our system is optimized for maximum accuracy and minimal delays.
What is the price of Smart ASR?
The price of our Smart ASR service varies depending on the specifics of implementing the service into your infrastructure and workflow and your volume of content. As a guide, Smart ASR captions are usually around half the cost of human-generated live captions.
How does the speed of Smart ASR captions compare to human-generated live captions?
Smart ASR captions have a similar or shorter time delay than human-generated live captions. The delay between the audio and the Smart ASR captions is usually around two to four seconds, whereas human-generated captions are usually delayed by around four to seven seconds.
What are custom dictionaries and how do they work?
Custom dictionaries are databases of terms and phrases that our captioning team uses to teach an ASR engine, so it produces the words correctly when it ‘hears’ them. Our team first researches and compiles key names and phrases on the session’s subject matter. Next, they use their in-depth knowledge of speech recognition software to program phonetic pronunciations into our Smart ASR engine. This process makes the live captions more accurate when consumers receive them to their screens.
Does Smart ASR meet regulatory requirements for the provision of live captioning?
Regulatory guidelines for live captioning vary greatly between countries and regions. While Smart ASR meets regulatory requirements for quality and time delay in many regions, some countries have specific requirements that Smart ASR does not yet meet, such as strict speaker change indication and on-screen positioning. If you are unsure of your country’s requirements, please get in touch for more information here.
Do we store your data?
Will Smart ASR impact human captioning?
Human input will always be essential for the accuracy and customization of our captions. Smart ASR would not be possible without the time and effort of our expert team, who operate and refine our custom dictionaries and artificial intelligence automation system.
We would also not be able to satisfy the majority of Ai-Media’s business, which is made up of human captioning!