[ad_1]
In November last year, the company announced its plans to create a language model supporting 1,000 of the world’s most-spoken languages while also revealing its USM model.
The tech giant describes USM as a family of state-of-the-art speech models with 2 billion parameters trained on 12 million hours of speech and 28 billion sentences of text, spanning 300+ languages.
“USM, which is for use in YouTube (e.g., for closed captions), can perform automatic speech recognition (ASR) not only on widely spoken languages like English and Mandarin, but also on under-resourced languages like Amharic, Cebuano, Assamese, and Azerbaijani to name a few,” Google said in a blogpost.
Google currently claims that USM supports over 100 languages and will serve as the “foundation” for a much larger system.
Discover the stories of your interest
Meanwhile, Google is expected to introduce a host of AI features for its products in the near future, and among them, Gboard for Android is working to integrate the Imagen text-to-image generator.
[ad_2]
Source link