SarahJamieLewis on Nostr: npub198t8k…5hasj I have done this. Basic steps: I first converted the book to plain ...
npub198t8kgwqas59rvmnghzcdn6krzhxhpkyt2mt53e4g9sdnj74sszss5hasj (npub198t…hasj)
I have done this. Basic steps:
I first converted the book to plain text, then wrote a program to chunk the files into small samples. This step removes hypens and quotation marks which TTS seems to have endless issues with still.
The program then calls the API for a self hosted coqui-ai/TTS server and downloads the returned audio file which is canonically numbered.
I then use ffmpeg to stitch together the wav files.
I have done this. Basic steps:
I first converted the book to plain text, then wrote a program to chunk the files into small samples. This step removes hypens and quotation marks which TTS seems to have endless issues with still.
The program then calls the API for a self hosted coqui-ai/TTS server and downloads the returned audio file which is canonically numbered.
I then use ffmpeg to stitch together the wav files.