What is SeamlessM4T?

Aug. 23, 2023

Meta, the technology company formerly known as Facebook, recently unveiled an advanced multilingual multimodal AI translation and transcription model named 'SeamlessM4T.'

About SeamlessM4T:

  • SeamlessM4T, which stands for Massively Multilingual and Multimodal Machine Translation, is an advanced multilingual multimodal AI translation and transcription model.
  • It was developed by Meta, the technology company formerly known as Facebook.
  • SeamlessM4T is capable of performing various tasks including speech-to-text, speech-to-speech, text-to-speech, and text-to-text translations.
  • SeamlessM4T supports:
    • Speech recognition for nearly 100 languages;
    • Speech-to-text translation for nearly 100 input and output languages;
    • Speech-to-speech translation, supporting nearly 100 input languages and 36 (including English) output languages;
    • Text-to-text translation for nearly 100 languages;
    • Text-to-speech translation, supporting nearly 100 input languages and 35 (including English) output languages;
  • Other Features:
    • SeamlessM4T brings together diverse spoken data sources to provide a comprehensive multilingual and multimodal translation experience from a single model. 
    • It performs the entire translation task in one go, unlike other large translation models that divide translation across different systems. 
    • It has the ability to recognise when a speaker is code-switching or when someone moves between two or more languages in one sentence. 
    • It also recognises gender bias in languages, and the model can quantify gender bias in translations.