I use Tortoise TTS. It's slow, a little clunky, and sometimes the output gets do... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		snerbles on Jan 2, 2024 \| parent \| context \| favorite \| on: OpenVoice: Versatile Instant Voice Cloning I use Tortoise TTS. It's slow, a little clunky, and sometimes the output gets downright weird. But it's the best quality-oriented TTS I've found that I can run locally. It's allegedly the basis of the tech used by Eleven Labs. https://github.com/neonbjb/tortoise-tts

xsdu on Jan 2, 2024 [–]

There are faster implementations of tortoise that allow fine-tuning. You can get close to ElevenLabs quality if you have a perfect dataset. https://git.ecker.tech/mrq/ai-voice-cloning

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact