Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Can you create an application that processes Lex Fridman's podcasts and removes his voice?


Maybe reading the transcripts of the podcasts would serve your purpose?


That approach could work, but it may make it more difficult to consume content while exercising.


Podcast -> Transcription —> Text to speech.


Hmm, I wonder if anyone has a simple pipeline for extracting data for "voice cloning" type models from the combination of original audio and transcribed text. It should be possible to chain this with some post-processing to replace Lex's voice with something more pleasing, and maybe throw in some automated rewriting of the transcript to remove the fluff.


Excellent flowchart! Thank you




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: