Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I quite like IndexTTS2 personally, it does voice cloning and also lets you modulate emotion manually through emotion vectors which I've found quite a powerful tool. It's not necessarily something everyone needs, but it's really cool technology in my opinion.

It's been particularly useful for a model orchestration project I've been working on. I have an external emotion classification model driving both the LLM's persona and the TTS output so it stays relatively consistent. The affect system also influences which memories are retrieved; it's more likely to retrieve 'memories' created in the current affect state. IndexTTS2 was pretty much the only TTS that gives the level of control I felt was necessary.



Wow, the IndexTTS2 demo is very good. Definitely going to check that out. Thanks.

[0] https://indextts2.org




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: