Ability to communicate in 100 different languages with the latest Meta artificial intelligence model

As part of its broader effort to remove language barriers and keep people connected, Meta has developed a multilingual base model that can understand nearly 100 languages from speech or text and create translations io one or two differe languages in real time. .

This multimodal technology, called SeamlessM4T, has been publicly released to help researchers develop and iroduce universal applications capable of providing speech-to-speech, speech-to-text, text-to-speech, and text-to-text translation. to work This collection together with SeamlessAlign; A multimodal translation dataset extracted from a total of 265,000 hours of speech and text has been made available.

This represes a significa advance in AI applications in linguistics, as it is a single system that can perform multiple speech and text-related tasks, whereas previous approaches required differe systems to perform each task, as Example of a dedicated system for speech-to-speech translation.

What can SeamlessM4T do?

As Meta explains, SeamlessM4T is able to implicitly recognize the source language without the need for a separate language recognition model. This model can recognize speech and text in nearly 100 languages and produce text with the same number and speech in 36 differe languages. More ierestingly, SeamlessM4T can recognize when more than one language is combined in the same seence and provide translations based on the target language requested. While previous systems required differe approaches for each task.

Experimes with BLASER 2.0, a tool for evaluating speech and text units, show that this model outperforms curre state-of-the-art models for speech-to-text translation. Specifically, this model performed better in dealing with background noise and speaker changes, with average improvemes of 37% and 48%, respectively.

“SeamlessM4T outperforms previous state-of-the-art competitors and significaly improves translation performance for low- and medium-resource languages,” Meta wrote in a blog post. In addition, it has maiained its strong performance in languages with high resources (such as English).”

If developed, this model could lead to the creation of large-scale global translation systems, allowing people who speak differe languages to communicate more effectively.

Notably, Google is also active in this field and in this regard, it has iroduced its Universal Speech Model (USM), which can perform automatic speech recognition (ASR) not only for common languages, but also for uncommon languages.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Ability to communicate in 100 different languages with the latest Meta artificial intelligence model

What can SeamlessM4T do?

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe