polvanrijn / VoiceMe
Repository for the paper: VoiceMe: Personalized voice generation in TTS
☆124Updated 2 years ago
Related projects: ⓘ
- Train the next generation of TTS systems.☆159Updated last week
- Monotonic Alignment Search☆83Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆193Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆117Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆132Updated 11 months ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆85Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆226Updated 6 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆182Updated 10 months ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 2 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆184Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆105Updated 2 years ago
- Official Implementation of StyleTTS-VC☆161Updated last year
- ☆69Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆144Updated 2 years ago
- ☆249Updated last year
- ☆110Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆85Updated last year
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆132Updated 4 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆122Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago
- ☆64Updated last year
- Easy-to-Use Speech MOS predictors☆209Updated 10 months ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆212Updated last month
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆95Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆134Updated last year
- ☆70Updated last year
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆149Updated 4 months ago