Tech News

Tech Business News

  • Home
  • Technology
  • Business
  • News
    • Technology News
    • Local Tech News
    • World Tech News
    • General News
    • News Stories
  • Media Releases
    • Tech Media Releases
    • General Media Releases
  • Advertisers
    • Advertiser Content
    • Promoted Content
    • Sponsored Whitepapers
    • Advertising Options
  • Cyber
  • Reports
  • People
  • Science
  • Articles
    • Opinion
    • Digital Marketing
    • Gaming
    • Guest Publishers
  • About
    • Tech Business News
    • News Contributions -Submit
    • Journalist Application
    • Contact Us
Reading: Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)
Share
Font ResizerAa
Tech Business NewsTech Business News
  • Home
  • Technology News
  • Business News
  • News Stories
  • General News
  • World News
  • Media Releases
Search
  • News
    • Technology News
    • Business News
    • Local News
    • News Stories
    • General News
    • World News
    • Global News
  • Media Releases
    • Tech Media Releases
    • General Press
  • Categories
    • Crypto News
    • Cyber
    • Digital Marketing
    • Education
    • Gadgets
    • Technology
    • Guest Publishers
    • IT Security
    • People In Technology
    • Reports
    • Science
    • Software
    • Stock Market
  • Promoted Content
    • Advertisers
    • Promoted
    • Sponsored Whitepapers
  • Contact & About
    • Contact Information
    • About Tech Business News
    • News Contributions & Submissions
Follow US
© 2022 Tech Business News- Australian Technology News. All Rights Reserved.
Tech Business News > World Tech > Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)
World Tech

Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)

The creation of USM is crucial for Google to realise its objective of organising and enhancing worldwide access to information.

Matthew Giannelis
Last updated: March 14, 2023 6:15 am
Matthew Giannelis
Share
SHARE

To keep up with the rising popularity of Open AI, Google has released its latest update and progress report for it’s Universal Speech Model (USM). Presently, the USM can handle roughly 300 languages, but the company has set its sights on expanding its capabilities to 1,000 languages.

The development of Google’s Universal Speech Model (USM) marks a crucial initial step towards building artificial intelligence (AI) capable of comprehending and translating 1,000 languages.

Google released groundbreaking details of their AI universal speech model, specially created to comprehend hundreds of spoken languages. The model’s remarkable capabilities are attributed to its extensive training on an impressive 12 million hours of speech and a staggering 28 billion sentences of text, encompassing over 300 languages.

This game-changing technology is poised to transform the landscape of global communication, and the world waits with bated breath for its next move.

The Universal Speech Model (USM) is engineered with a standardised encoding-decoding architecture that utilizes convolutional, attention, and feed-forward modules for its encoder.

This innovative design makes it compatible with LAS, CTC, and RNN-T decoders, pushing the boundaries of what is possible in the realm of artificial intelligence.

Google is leaving no stone unturned in its pursuit of achieving its lofty 1000-language objective. The tech giant is utilising a powerful machine learning model that focuses on the most frequently used languages. Nonetheless, numerous languages have a minimal number of speakers, which restricts the availability of data

In a bid to overcome this hurdle, Google is taking a multi-pronged approach by aggregating and analysing data from various language sources. The goal is to create more accurate results through automatic speech recognition scaling, laying the foundation for a future where language barriers are no longer an obstacle.

According to recent reports, the data available on YouTube is relatively limited in comparison to that of OpenAI. Nevertheless, the Universal Speech Model (USM) has been found to generate 30% fewer word errors than its OpenAI counterpart.

Google says that early tests of the model were designed to create captions on YouTube videos and can perform automatic speech recognition on 100 languages. Some of the languages are spoken by fewer than twenty million people, making it very difficult to find the training data.

This cutting-edge technology marks a significant step forward in the tech giant’s overarching mission to make information easily accessible to everyone, everywhere. With the USM leading the way, the world can look forward to a future where language barriers are no longer a hindrance to communication and understanding.

The machine learning model’s training pipeline is a multi-step process that starts with self-supervised learning of audio files that span numerous languages. Afterward, the quality and language coverage of the data are optimised by incorporating text data wherever possible.

The next step is the fine-tuning of downstream operations, which leverages supervised data to enhance the model’s performance even further. This comprehensive training regimen is a testament to the tech giant’s unwavering commitment to pushing the boundaries of what is possible in the realm of artificial intelligence.

According to a research paper by Google, which was authored by numerous contributors, there are two types of models generated by pre-trained models that can be fine-tuned for downstream tasks. In addition, there are generic automatic speech recognition (ASR) models, which the researchers believe are not fine-tuned downstream.

Google’s researchers assert that these generic ASR models are scalable and can improve the performance of models trained on shorter utterances when applied to longer speech inputs.

Google has also prioritised improving its speech-to-text transcription accuracy as an important area of focus. Specifically, the company has been developing methods to minimise errors and inaccuracies in transcriptions, which can pose significant challenges in certain situations.

USM now supports all 73 languages in the YouTube Captions’ Test Set and outperforms Whisper on the languages it can support with lower than 40% WER. Lower WER is better.

Also, the Google Artificial Intelligence team has been investigating novel approaches to enhance the precision of its speech models by utilising better training data. These endeavors have involved gathering more varied and inclusive datasets, which can aid in improving the model’s capacity to identify and transcribe speech from diverse sources.

In addition, the algorithm is currently confronting several challenges. Based on research and ongoing competitions, the comprehension algorithm must be versatile, robust, and adaptable to facilitate model improvement in a computationally efficient manner, while expanding language coverage and proficiency.

The algorithm should have the ability to process large volumes of data from various sources, generalize to new languages and use cases, and facilitate model upgrades without requiring extensive retraining.

Undoubtedly, there are apprehensions and concerns about the potential misuse or abuse of these technologies. For instance, concerns have been raised regarding the reliability of speech recognition technologies when utilised in legal proceedings or when transcribing discussions that involve delicate or confidential information.

In general, Google’s initiatives in this domain are part of a larger movement aimed at enhancing AI-driven speech recognition and transcription. As voice-activated interfaces become more prevalent, these technologies will assume an increasingly vital function in a wide range of applications, from virtual assistants to customer service chatbots and beyond.

Despite the aforementioned concerns, it is evident that AI-driven speech recognition and transcription will persist as a significant focus for companies like Google in the coming years.

As these technologies continue to advance, they are likely to proliferate and become even more robust, potentially revolutionising our interactions with computers and each other.

Google published a blog entry by members of the team working on the project. It’s team members also published a paper describing the introduction of its Universal Speech Model (USM) on the arXiv pre-print server.

ByMatthew Giannelis
Follow:
Secondary editor and executive officer at Tech Business News. An IT support engineer for 20 years he's also an advocate for cyber security and anti-spam laws.
Previous Article Customers Dont Like Or Hate Chatbots. Annoying Why Customers Don’t Like Or Hate Chatbots – Annoying & Bad For Business
Next Article China Set To Increase Defense Spending By 7.2% China Set To Increase Defense Spending By 7.2% To 1.56 Trillion Yuan ($230 billion)
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Google Universal Speech Model (USM) Progress Report Update

Tech Articles

AI Is Forcing Developers To Abandon Untyped Code

Why AI Is Forcing Developers To Abandon Untyped Code

AI has made ambiguity a liability, with developers spending over…

January 13, 2026

How the World’s Data Centres Are Quietly Burning the Planet

Data centres are burning the planet, with a growing environmental…

March 11, 2026
Australia's Heavy Vehicle EV Charging Market

Australia’s Heavy Vehicle EV Charging Market: A Critical Infrastructure Gap Being Filled

Australia’s heavy EV market is accelerating, but charging is the…

February 15, 2026

Recent News

Aarrest of 39-year-old Pavel Durov last Saturday
World Tech

Telegram Boss Pavel Durov Arrest Fires A Warning Shot To Global Tech Titans

8 Min Read
Telsa In Car Gameplay Distraction
World Tech

US safety body probe Tesla on in-car gameplay

6 Min Read
The ‘996’ Work Culture That’s Causing in China
World Tech

China’s Relentless Work Culture Is Pushing Employees To Their Limits

2 Min Read
World Tech

Windows 11 leak reviles New UI, Deskop and Windows Store

4 Min Read
Tech News

Tech Business News

In 2026, technology news is shaping business outcomes faster than ever—driven by AI adoption, rising cyber risk, cloud modernisation, data regulation, and constant platform change.


Tech News keeps Australian organisations and industry professionals informed with timely reporting and practical coverage across AI, cybersecurity, cloud, enterprise IT, startups, science, people and business, plus major world and local news impacting the tech sector.


Tech Business News publishes news and analysis designed to be clear, relevant, and easy to act on. It supports the industry with technology news reports, whitepaper publishing services, and a range of media, advertising and publishing options 

About

About Us 
Contact Us 
Privacy Policy
Copyright Policy
Terms & Conditions

April, 04, 2026

Contact

Tech Business News
Melbourne, Australia
Werribee 3030
Phone: +61 431401041

Hours : Monday to Friday, 9am 530-pm.

Tech News

© Copyright Tech Business News 

Latest Australian Tech News – 2026

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?