CookiePPP / podcast_rss_feeds
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆10Updated last year
Alternatives and similar repositories for podcast_rss_feeds:
Users that are interested in podcast_rss_feeds are comparing it to the libraries listed below
- text-to-audio-latent-diffusion☆37Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 10 months ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆82Updated 7 months ago
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- Audiogen Codec☆133Updated 9 months ago
- ☆65Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆148Updated 2 years ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 11 months ago
- ☆40Updated 5 months ago
- ☆107Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 6 months ago
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆66Updated 7 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 2 months ago
- ☆62Updated 8 months ago
- Supervoice diffusion enhance☆26Updated 8 months ago
- fine-tuning MusicGen without prompts to generate music with a specific style☆62Updated last year
- Monotonic Alignment Search☆90Updated 2 years ago
- ☆84Updated last year
- Code for Investigating Personalization Methods in Text to Music Generation☆36Updated last year
- VoiceBox neural network implementation☆105Updated 8 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- Zero-Shot Emotion Style Transfer☆43Updated last year
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆64Updated this week
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year