Home / Resources / Blog / Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Google has launched an AI-powered speech synthesis system named Tacotron 2, poised to set a major breakthrough with its human-like articulation ability. Reports from tech analysts state that the new text-to-speech system delivers an AI-generated computer speech, which cannot be easily distinguished from human voice. Google’s AI researchers quote that their model has achieved a MOS (Mean Opinion Score) of 4.53, in comparison to a MOS of 4.58 for professionally recorded speech. The tech giant’s vision shift from “mobile-first” to “AI-first”, announced during the Google I/O 2017 developers conference by Sundar Pichai, is bearing more fruits. Several AI products were launched last year, including Google Lens, Smart Reply for Gmail and Google Assistant for iPhone. Tacotron 2 is the latest addition to this list.

How it Works? The system first creates a spectrogram of the text, which contains a visual representation of how the speech should sound. This image is then fed into Google’s WaveNet algorithm, which brings AI skills closer, in order to mimic human speech. The algorithm has the ability to easily learn different voices and can even generate artificial breaths.

Looking at the capabilities, Tacotron 2 can detect the context and differentiate between two identically-spelled words. For example, it can distinguish between the noun “desert” and the verb “desert” and alter the pronunciation accordingly. Context-driven pronunciation is the highlight of Tacotron 2. The system can understand the sentence type (such as a statement or a question) and adjust the pitch and modulation of the sentence while speaking.

With Tacotron 2, Google is taking one more step towards realizing its “AI-first” dream. In the coming days, we can expect more brilliant AI products from the tech master.

Zerone develops bespoke software solutions carefully customized for the needs of our clients. Contact an expert today

Want to discuss your project?
We can help!

Name*
Business Email*
Phone*

Related blogs

Unlocking The Potential Of Ai In Business Process Outsourcing

#Artificialintelligence

Gold Standard Data: Driving Accuracy In Domain-specific Medical Ai

#Artificialintelligence

Optimizing Llm Costs: A Key To Sustainable Ai Solutions

#Artificialintelligence

Book Your Free DocuSenze for Legal Demo

Enter your details to download the blog

Name*
Business Email*
Phone*

Never Miss a Beat

Join our LinkedIn community for the latest industry trends, expert insights, job opportunities, and more!

Tacotron 2 to Create Ripples in AI-based Text-to-Speech Translation

Unlocking The Potential Of Ai In Business Process Outsourcing

Gold Standard Data: Driving Accuracy In Domain-specific Medical Ai

Optimizing Llm Costs: A Key To Sustainable Ai Solutions

Book Your Free DocuSenze for Legal Demo

We’re glad you’re here. Tell us a little about your requirement.

Thank you!