Very interesting! Is the music an intentional blended track or an artifact of ge...

_josh_meyer_ · on Jan 3, 2022

very much intentional.

Background music makes misuse/abuse less likely (both intentional and unintentional)

Read more here about in our open discussion: https://github.com/coqui-ai/TTS/discussions/1036

Gigachad · on Jan 4, 2022

I appreciate the effort here, but it almost feels like this is hopeless as it seems so many groups are able to build voice synthesis right now that the tech has fallen in to the common persons hand and some of them won't make any effort to stop abuse.

Maybe if we can get watermarked stuff out first and the average person gets up to speed with what tech can do, we can all adjust our expectations before the real wave of abuse hits.

marcan_42 · on Jan 4, 2022

You can probably run the output through Spleeter[1] and get rid of the background music very easily. Just throw more AI at the problem...

It's very hard to curb intentional misuse.

[1] https://github.com/deezer/spleeter