bryan051003 / USVGLinks
A unified model for zero-shot singing voice conversion and synthesis
☆21Updated 2 years ago
Alternatives and similar repositories for USVG
Users that are interested in USVG are comparing it to the libraries listed below
Sorting:
- ☆15Updated 4 years ago
- ☆24Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- ☆45Updated 2 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated 2 years ago
- A repo that builds text to music datasets from scratch☆21Updated 2 weeks ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 10 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- Temporary anonymous version☆22Updated last year
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Updated last year
- ☆87Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated 11 months ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- ☆26Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆28Updated last year
- ☆23Updated 11 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- a pytorch implementation of Google GEDLoss☆32Updated 4 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year