Ticker

6/recent/ticker-posts

Ad Code

Responsive Advertisement

Google Cloud updates T2S [ text to speech ]

Google Cloud on Thursday announced it's updating its T 2 S products with more voice and greater languages. Google has also stepped forward the 1 rate of its Speech-to-Text transcription tools and is bringing a number of their capabilities into general availability. The updates must assist developers build intelligent voice programs which can reach millions of greater human beings and characteristic extra correctly.

For Text-to-Speech, Google has kind of doubled the variety of voices to be had for the reason that its final update in August. It's added support for seven new languages or editions, consisting of Danish, Portuguese/Portugal, Russian, Polish, Slovakian, Ukrainian and Norwegian BokmÃ¥l -- all in beta. The product now helps a complete of  twenty one languages.



Across those new languages, Google has delivered thirty one new WaveNet voices and 24 new trendy voices. Google says it now helps a complete of one hundred 6 voices.

WaveNet is a deep neural network for producing raw audio, which creates voices that are greater herbal-sounding than popular text-to-speech voices. The generation become created via DeepMind, the AI company Google received in 2 thousand 14
"Thanks to unique get right of entry to to WaveNet technology powered through Google Cloud TPUs, we are able to construct new voices and languages quicker and less difficult than is regular in the industry," Google product manager Dan Aharon said in a blog post.

Google's number one competition for Text-to-Speech offerings is Amazon Web Services' Polly, which according to its internet site presently allows fifty eight voices.

In addition to including new voices, Google's Text-to-Speech Device Profiles function is now typically to be had. This shall we customers optimize audio playback on distinctive kinds of hardware, which includes headphones for media programs like podcasts.
Meanwhile, for Speech-to-Text, Google is bringing into popular availability top class fashions for video and more desirable smartphone, which have been rolled out in beta ultimate year. The video model, which is primarily based on generation similar to what YouTube uses for automated captioning, now has 64 percentage fewer transcription mistakes, Google introduced. The greater cellphone model now has sixty two percentage fewer mistakes.

Google changed into able to advanced the models via requiring clients who used the premium services to proportion usage facts thru statistics logging. Starting now, customers can use the enhanced smartphone version without opting into records sharing, while folks that decide in can pay a decrease charge. Prices are also decrease for all top rate video model customers, and people who choose into facts sharing gets an extra discount

Post a Comment

0 Comments