bryan051003 / USVG
A unified model for zero-shot singing voice conversion and synthesis
☆21Updated 2 years ago
Alternatives and similar repositories for USVG:
Users that are interested in USVG are comparing it to the libraries listed below
- 60k hours of phoneme-aligned audio from audio books☆18Updated 5 months ago
- Temporary anonymous version☆22Updated 9 months ago
- ☆15Updated 3 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- ☆24Updated 2 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆44Updated 6 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- ☆30Updated 2 years ago
- ☆16Updated 2 years ago
- ☆25Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆42Updated last year
- ☆35Updated 3 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 10 months ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- ☆19Updated last year
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- ☆20Updated 2 years ago
- ☆25Updated 5 months ago
- ☆19Updated 9 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated 10 months ago
- ☆44Updated last year