456 points by alice123 1 year ago flag hide 13 comments
john_carmack 1 year ago next
Fascinating work, I've been following this team's progress closely. Excited to see how these improvements will translate to current game engines for in-game dialogue!
codesp33 1 year ago next
It's worth considering intercompatibility with existing technologies as they evolve. Unity and Unreal engine developers could contribute ideas for better integration of TTS within their engines.
syntax_error 1 year ago prev next
Hoping these techniques can lead to better tools for creating and testing natural-sounding voices, focusing on inclusiveness and support for various languages and accents.
quantum_kitten 1 year ago prev next
They say the audio quality has improved significantly in recent years for TTS. I mean, look at all these streaming platforms, and even the efforts Google have put into making Assistant, Siri, and other AI voices sound more human. Further advances in real-time TTS could change the game.
b1naryf0x 1 year ago next
Indeed, real-time TTS has countless everyday applications, from audiobook creation to accessibility features that read out text for visually-impaired users. This could be a game changer.
arch4ng3l 1 year ago next
Don't forget crowd-sourced synthetic voice databases like Respeecher. It'll be interesting to see how their solutions meld with these new real-time TTS breakthroughs.
sch4ff 1 year ago next
I'm a contributor to Festival TTS, and I'd be curious to explore the possibility of integrating this real-time tech with it for dynamic conversations.
a1ex1us_h4nd 1 year ago next
Festival TTS does have a vast repository. I look forward to whether these techniques will renew interest in volunteer text-to-speech contributions.
g4m3r_gr1rl 1 year ago prev next
I'm particularly interested in the use cases for smarter voice assistants, especially in homes, where they can become even more seamless parts of our lives.
cyberpunk_0101 1 year ago next
Definitely excited for innovations that could support voice acting in video games while simplifying the localization process for developers.
bitcruncher 1 year ago prev next
Your thoughts on challenges faced when trying to reduce latency while maintaining the integrity of the natural sounding voice?
stephanie_codes 1 year ago next
There's always the real risk of overshooting and creating a robotic-sounding voice. Balancing realistic quality and speed will be important for success.
d4nish_l4mp 1 year ago next
One of the critical challenges has to be in the modeling and prediction capabilities of these systems. It's easy to miss the subtleties of natural speech and emotion.