Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So the history prompts are collections of text/audio pairs?


history is semantic, coarse and fine. so essentially the same thing thats getting generated just using it as an input before the generation


So how do you clone an existing speaker's voice? That's the part I don't get.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: