Posted by Rakesh Iyer, Staff Software Engineer and Leland Rechis, Group Product Manager
We’re significantly upgrading the speech services of Google’s speech engine, enabling us to deliver clearer, more natural voices. All 421 voices in 67 languages have been upgraded with a new voice model and synthesizer.
If you’re already using TTS and the Speech Services by Google engine, you don’t need to do anything – everything happens behind the scenes, as your users have automatically downloaded the latest update. We have seen a significant improvement in quality with this change, especially in terms of clarity and naturalness.
With this upgrade we will also change the default voice in the en-US to one built using newer speaker data, resulting in a drastic improvement in addition to our new stack. If your users have not selected a system voice and you rely on the system’s default settings, they will hear a slightly different speaker. You can hear the difference below:
Speaker change and upgrade for EN-US
Example current speaker |
Example Upgraded Speaker |
Speaker upgrades in a few other languages
This update will be rolled out to all 64-bit Android devices via the Google Play Store in the coming weeks as part of the Speech Services by Google apk. If you’re concerned that your users haven’t updated this yet, you can check the minimum version code 210390644 on the com.google.android.tts package.
If you haven’t already used TTS in your projects, or if you haven’t given users the option to choose a voice in your app, it’s pretty simple and easy to experiment with. We’ve included some sample code to get you started.
Here’s an example of how to set up voice synthesis, get a list of voices, and set a specific voice. We finally send a simple utterance to the synthesizer.
We are excited to see this improved experience in your app!