gitmylo / bark-data-gen
Create training data for training a voice cloner for bark text to speech.
☆44Updated last year
Alternatives and similar repositories for bark-data-gen:
Users that are interested in bark-data-gen are comparing it to the libraries listed below
- Zero-Shot Emotion Style Transfer☆43Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- Community framework for training tortoise☆41Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- ☆33Updated last year
- ☆69Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- Finetuning VITS Efficiently☆32Updated last year
- ☆30Updated 2 years ago
- ☆71Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆87Updated last year
- ☆29Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 10 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- All generative model in one for better TTS model☆66Updated 7 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆148Updated 2 years ago
- ☆26Updated last year
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆74Updated 11 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year