Hacker Newsnew | past | comments | ask | show | jobs | submitlogin



I thought this would be about text-to-speech applications, while this seems more like an encoder-decoder problem (make the network learn a pattern and then let it reproduce it). I'm wondering how long it is until we see working TTS based on LSTM RNNs.


Yeah, can someone explain the exact problem of "statistical parametric speech synthesis," since I can't find a general overview of the problem itself.


I'm a newbie to all this, but I can imagine it could be useful for speech compression.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: