CookiePPP / podcast_rss_feeds
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆9Updated last year
Alternatives and similar repositories for podcast_rss_feeds:
Users that are interested in podcast_rss_feeds are comparing it to the libraries listed below
- text-to-audio-latent-diffusion☆37Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated last year
- ☆62Updated 6 months ago
- The demo page of UniAudio☆34Updated 11 months ago
- Official Implementation of StyleTTS-VC☆174Updated 2 weeks ago
- ☆28Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆74Updated last month
- Zero-Shot Emotion Style Transfer☆41Updated 9 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 5 months ago
- ☆33Updated 2 months ago
- Monotonic Alignment Search☆88Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆130Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆98Updated last week
- Codebase and project page for EDMSound☆33Updated last year
- Supervoice diffusion enhance☆26Updated 6 months ago
- Finally, some decent sample sentences☆22Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆29Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 9 months ago
- Audiogen Codec☆130Updated 6 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 3 weeks ago
- Faster Tortoise inference then Tortoise Fast Fork☆126Updated 9 months ago
- A TTS model that makes a speaker speak new languages☆75Updated 7 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆145Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis☆126Updated 2 weeks ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆39Updated 2 weeks ago
- ☆20Updated 2 years ago
- Flexible LoRA Implementation to use with stable-audio-tools☆57Updated 4 months ago