hetpandya / youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
☆35Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for youtube_tts_data_generator
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl …☆158Updated 2 years ago
- ☆165Updated 2 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆128Updated 11 months ago
- Interface for Controllable Expressive Talking Machine☆38Updated 10 months ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆81Updated last year
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆125Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆144Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆114Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 3 years ago
- ☆77Updated 6 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.☆111Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆81Updated last year
- ☆109Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆87Updated 3 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- Collect Voice Conversion researches☆90Updated this week
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆121Updated 3 years ago