gitmylo / bark-data-gen
Create training data for training a voice cloner for bark text to speech.
☆43Updated last year
Alternatives and similar repositories for bark-data-gen:
Users that are interested in bark-data-gen are comparing it to the libraries listed below
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- Official Implementation of StyleTTS-VC☆175Updated last month
- ☆33Updated last year
- ☆71Updated last year
- Train the next generation of TTS systems.☆162Updated 5 months ago
- Implementation of Emo-StarGAN☆45Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- ☆26Updated 11 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆67Updated last year
- ☆68Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated last year
- audiolm-pytorch training code☆15Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆121Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆79Updated 10 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆46Updated last week
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated last year
- Finetuning VITS Efficiently☆32Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆136Updated 3 months ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- All generative model in one for better TTS model☆66Updated 5 months ago
- ☆67Updated 3 weeks ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆163Updated this week
- A simple voice conversion tool☆17Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆98Updated 3 weeks ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆146Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year