Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Very interesting! Is the music an intentional blended track or an artifact of generation?


very much intentional.

Background music makes misuse/abuse less likely (both intentional and unintentional)

Read more here about in our open discussion: https://github.com/coqui-ai/TTS/discussions/1036


I appreciate the effort here, but it almost feels like this is hopeless as it seems so many groups are able to build voice synthesis right now that the tech has fallen in to the common persons hand and some of them won't make any effort to stop abuse.

Maybe if we can get watermarked stuff out first and the average person gets up to speed with what tech can do, we can all adjust our expectations before the real wave of abuse hits.


You can probably run the output through Spleeter[1] and get rid of the background music very easily. Just throw more AI at the problem...

It's very hard to curb intentional misuse.

[1] https://github.com/deezer/spleeter




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: