Technology

Microsoft’s speech recognition system reaches human-level accuracy

Tech company Microsoft’s 25-year long wait is over as the company today has announced that its speech recognition system technology has reached human-level accuracy. Last year Microsoft’s researchers recorded 5.9 percent error rate in the system and now with the recent development, it stands at 5.1 percent error rate.

Microsoft said that it has lowered the error rate by introducing CNN-BLSTM (convolutional neural network combined with bi-directional long-short-term memory) model to its system.

Researchers at Microsoft have also been working on neural net-based acoustic as well as on language models, which resulted in the reduction of error rate. Speech recognition system of Microsoft is used in many of its services such as Cortana, Presentation Translator, and Microsoft Cognitive Services.

The company said in a blog post,

“After our transcription system reached the 5.9 percent word error rate that we had measured for humans, other researchers conducted their own study, employing a more involved multi-transcriber process, which yielded a 5.1 human parity word error rate. This was consistent with prior research that showed that humans achieve higher levels of agreement on the precise words spoken as they expend more care and effort. Today, I’m excited to announce that our research team reached that 5.1 percent error rate with our speech recognition system, a new industry milestone, substantially surpassing the accuracy we achieved last year.”

It would be worth mentioning here that the company last year formed 5000-person Artificial Intelligence and Research group to research in the field of (Artificial Intelligence) AI and to compete with other tech companies which are also researching AI and cloud technology. And now with this recent achievement, one can surely say that Microsoft’s decision of forming a 5000-person Artificial Intelligence and Research group has paid off.

Sponsored
Ali Leghari

Leave a Comment
Share
Published by
Ali Leghari

Recent Posts

China’s Tencent Releases Large Language Model, Opens it For Enterprise Use

Capable of conversing in both Chinese and English, Tencent’s large language model ‘Hunyuan’ is claimed…

8 months ago

Apple Reportedly Spending ‘Million of Dollars Each Day’ for AI Training

Working on multiple AI models, Apple has allocated several teams who are working on artificial…

8 months ago

World’s Largest Wind Turbine Breaks Record For Power Generated In A Single Day-During A Typhoon

The world's largest offshore wind turbine has achieved a milestone by setting a new record…

8 months ago

YouTube Will Let You Play Mini Games Soon

YouTube is stepping into the world of gaming. YouTube has started testing out its gaming…

8 months ago

Pakistani Student Won First Position In Matric Exams of UAE

In a remarkable academic achievement, Abdullah Zaman, a Pakistani student hailing from Attock, has clinched…

8 months ago

‘Flying Bum’ World’s largest Aircraft Is Ready To Launch In 2026 With Hybrid Technology

Flying Bum, the world's largest aircraft is ready to launch in 2026. The Airlander 10…

8 months ago