Quote:
Originally Posted by salamanderjuice
There are models that can be annotated with emotions.
Does AI surpass the best human readers? No. But they are leagues above the old TTS and honestly a lot of people if my high school English class was remotely typical.
All I want is them to sound natural and not flub words.
IMO I could listen to whole book if it sounded like this.
|
There are none that can read text properly without extra contextual commands put by humans. It's all a scam (AI).
It just sounds better in demos. I was invited and used Google's bleeding edge to a book. I did just one chapter and while it sounded nice, it was a failure. You could script intonation, emotion, loudness and phonetic hints in the 1980s.
It's even worse if it's not a USA English text. In other news, AI powered self-service supermarket checkouts train humans to steal, but since it's cheaper than the Irish minimum wage including losses they don't care.