Announcing Accuracy Evaluation For Cloud Speech-To-Text

May 2, 2023

2 min read

We are thrilled to introduce Accuracy Evaluation, the newest feature in our Cloud Speech UI, to allow for easy and seamless benchmarking of our Speech-to-Text (STT) API models and configurations. The STT API covers a wide variety of use cases, from dictation and short commands, to captioning and subtitles. Getting the most of STT, however, can be a complicated process. To achieve the highest accuracy on any AI use case requires careful testing and tuning to find just the right configuration.

We have been diligently listening to customer feedback, and looking for a quick and effective way to benchmark our current and future STT API offerings. Previously, our customers and enterprise users had to do this work manually. This included invoking the API to generate the transcripts and save the result, then using a command-line tool, relying on a third-party library, or writing code to compare the STT system results with a ground-truth file. For every model and configuration, this process had to be redone, which was cumbersome,time-consuming and error prone.

A 3-step process to measure accuracy

Today’s announcement significantly simplifies the process. Now, the user-friendly interface in the Accuracy Evaluation feature in our Cloud Speech UI makes it easy for anyone on your team to evaluate the accuracy of our STT API against your own datasets. To begin, customers upload audio files, specify the desired STT API configurations and ground-truth, and the benchmarking is done automatically for you. To ensure maximum privacy and security, audio files uploaded are only processed inside your own Google Cloud Tenant Project.

To measure and compare the accuracy of our STT API, we use the industry standard of Word Error Rate (WER), a simple, easy-to-understand metric that can be compared across different models and datasets. It is defined as the ratio of the total number of errors (Insertions, Deletions, and Substitutions) to the total number of words in the reference transcript, and it ranges from 0%, when the output of the STT system matches exactly the ground-truth, to 100%, when there is no match at all. Our tool calculates WER for the STT output and the ground-truth, while also providing a detailed breakdown on the Insertion, Substitution and Deletion errors, giving scientists and application developers exactly the information they need to be successful in their workflow.

https://storage.googleapis.com/gweb-cloudblog-publish/images/Accuracy_Evaluation.max-1900x1900.jpg

To access Accuracy Evaluation, log in to our Speech-to-text User Interface and navigate to the “Transcriptions” tab. After you have successfully transcribed your audio file, use the Transcription Accuracy section. Click the Upload Ground Truth button at the top of the section to begin calculating accuracy.

Learn more about Accuracy

Detailed instructions on how to use the new feature can be found here, and if you are curious to learn more about how accuracy is measured in production-facing Speech Transcription systems, you can find our documentation here.

We are excited to see the insights and improvements you can achieve with Accuracy Evaluation on Cloud Speech UI and we look forward to supporting you with the best in-class Speech-to-Text systems.

By Haris Ioannou Product Manager, Cloud Speech
Originally published at Google Cloud

Source: Cyberpo g o

For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

liwaiwai

Speak The Language Of The Future! Here Are The Top 10 Programming Languages For AI

How Sovereign Funds Could Empower The Future Of Assistive Technology And Disability AI

August 22, 2023

Artificial intelligence (AI) algorithms are rapidly fuelling assistive technologies for individuals with…

5 min read

DarwinAI Makes AI Applications More Efficient And Less Of A ‘Black Box’ — With Its Own AI

March 27, 2020

As a student pursuing a doctorate in systems design engineering at the University of Waterloo, Alexander Wong…

3 min read

PT Meratus Line Enters Alliance With Google Cloud And PT Metrodata Electronics Tbk To Build Indonesia’s First Maritime Logistics Super App

February 20, 2023

Indonesia’s leading maritime and logistics operator will also harness Google Cloud’s AI capabilities to empower…

5 min read

Can You Write An Entire Blog Post With ChatGPT?

February 13, 2023

Lets get that question answered and ask Mr ChatGPT himself. Here is an article produced by ChatGPT…

4 min read

Announcing Accuracy Evaluation For Cloud Speech-To-Text

A 3-step process to measure accuracy

Learn more about Accuracy

For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Speak The Language Of The Future! Here Are The Top 10 Programming Languages For AI

Exploring The Technology And Training Secrets Behind ChatGPT

People Are Using AI Chatbots to Guide Their Psychedelic Trips

Exploring data and its influence on political behavior

New postdoctoral fellowship program to accelerate innovation in health care

Confronting the AI/energy conundrum

Building secure, scalable AI in the cloud with Microsoft Azure

Robotic probe quickly measures key properties of new materials

Confronting the AI/energy conundrum

Despite Protests, Elon Musk Secures Air Permit for xAI

From Sensual Butt Songs to Santa’s Alleged Coke Habit: AI Slop Music Is Getting Harder to Avoid

Here’s What Mark Zuckerberg Is Offering Top AI Talent

A Pro-Russia Disinformation Campaign Is Using Free AI Tools to Fuel a ‘Content Explosion’

Livestream Replay: Beginner Advice for Claude, a ChatGPT Alternative

Announcing Accuracy Evaluation For Cloud Speech-To-Text

A 3-step process to measure accuracy

Learn more about Accuracy

For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Share this article

Speak The Language Of The Future! Here Are The Top 10 Programming Languages For AI

Exploring The Technology And Training Secrets Behind ChatGPT

Read next