Speech recognition is an important detail in the artificial intelligence engine. This technology controls the voice assistants in our phones, cars, and smart speakers.

But despite its ubiquity, development is still ongoing. Facebook today announced a major breakthrough. The company has developed a speech recognition system that learns without any human input.

Breakthrough innovation

Modern speech recognition systems are trained on audio recordings of conversations and their text transcripts. These transcripts are handwritten by humans. This is a long and boring work since training artificial intelligence requires a huge amount of educational material.

Facebook’s new system, Wav2vec-U, avoids this. Artificial intelligence learns to recognize speech without any decryption. It is enough to “feed” her an audio recording of the speech and a text written in the same language. Audio and text may not be related to each other in any way. Further, the generative-adversarial network repeatedly “runs” speech samples until it finds a correspondence between sound combinations and words.

This is a truly breakthrough technology that allows you to train AI to recognize even very rare languages. As part of the tests, Facebook engineers taught the system to understand Swahili, Kyrgyz, and Crimean Tatar languages. It took about 10 hours of recorded speech and 3000 lines of text to learn each language.

Why is this needed?

The development allows you to create a speech recognition system for literally every living language on the planet, including the rarest languages ​​of very small peoples. And if the algorithm can translate a language into text on the fly, then it can provide this text with subtitles in another language or simultaneous translation.

In fact, Facebook has almost completely destroyed language barriers. Imagine a world where everyone can understand everyone thanks to a smart gadget with translator software.

Facebook is already gearing up to begin building speech recognition systems for a huge number of languages ​​and dialects around the world.

Related Posts

Oppo Watch 3 – all details revealed

The Oppo company will soon present its new smartwatch, the Oppo Watch 3. The model should receive an LTPO display with LTPO technology, which makes it possible…

What new products will Samsung show on August 10

A live broadcast of the event, where the audience will witness a number of new Samsung gadgets and products, will take place on August 10. What the…

LG Ultra Tab officially presented

The LG company officially presented the new LG Ultra Tab tablet on the South Korean market. The model received the Android 12 operating system, a 10.35-inch IPS…

Samsung Galaxy A23 5G officially presented

Samsung officially presented the new Galaxy A23 5G smartphone. The model received support for operation in fifth-generation networks. The smartphone is available in configurations with 4 GB,…

Nothing will release two models of wireless headphones

Recently, the business debuted the Nothing Ear headphones (1), but now it was revealed that there would be two other kinds of wireless devices. This is according…

The cheapest Redmi smartphone with 5G gets MIUI 13

Redmi Note 9T was the first smartphone of the Xiaomi sub-brand for the international market, supporting 5G networks. It was one of the cheapest options with such…

Leave a Reply

Your email address will not be published.