Google’s new AI can help you speak another language in your own voice

Google Translate is one of the company’s most used products. It helps people translate one language to another through typing, taking pics of text, and using speech-to-text technology. Now, the company’s launching a new project called Translatotron, which will offer direct speech-to-speech translations – without even using any text.

In a post on Google’s AI blog, the team behind the tool explained that instead of using speech-to-text and then text-to-speech to convert voice, it relied on a new model (which runs on a neural network) to develop the new system.

“Dubbed Translatotron, this system avoids dividing the task into separate stages, providing a few advantages over cascaded systems, including faster inference speed, naturally avoiding compounding errors between recognition and translation, making it straightforward to retain the voice of the original speaker after translation, and better handling of words that do not need to be translated (e.g., names and proper nouns),” the Google research team wrote in the blog post.

Translatotron can also preserve the characteristics of the voice of the speaker when translating from one language to another. This could be really useful to sound editors who dub movies and TV shows.

The 💜 of EU tech

The latest rumblings from the EU tech scene, a story from our wise ol' founder Boris, and some questionable AI art. It's free, every week, in your inbox. Sign up now!

The researchers have admitted that translations from the new model are not as precise as traditional models, but they’re confident the accuracy of the new model will soon improve.

There are plenty of apps out there like iTranslate and SayHi that try to translate one language to another using voice. But they’re still not as smooth and error-free as one would like.

Considering this still a model (and theres not even a demo available yet), chances are Google will take a while to implement the new system in consumer-grade solutions. I, for one, am looking forward to trying it out though.

You can read more about Google’s new technology here, and you can read more about the model the research team used here.

Story by Ivan Mehta

Ivan covers Big Tech, India, policy, AI, security, platforms, and apps for TNW. That's one heck of a mixed bag. He likes to say "Bleh." Ivan covers Big Tech, India, policy, AI, security, platforms, and apps for TNW. That's one heck of a mixed bag. He likes to say "Bleh."

Get the TNW newsletter

Get the most important tech news in your inbox each week.

Also tagged with

Google

Google’s new AI can help you speak another language in your own voice

Get the TNW newsletter

Also tagged with

A German court says Google’s AI Overviews are Google’s own words, and it’s liable when they’re false

Your face is the ticket: Google’s Gemini and biometric gates are the World Cup’s quieter tech story

Discover TNW All Access

US AI giants are colonising London, and squeezing its startups in the process

Google is funding 300,000 electricians and welders, because the AI boom is running out of them