Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"
☆21Apr 7, 2021Updated 4 years ago
Alternatives and similar repositories for FastSVC
Users that are interested in FastSVC are comparing it to the libraries listed below
Sorting:
- ☆22Jul 8, 2019Updated 6 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆83Dec 31, 2025Updated 2 months ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Code to train and run Blow☆145Sep 4, 2019Updated 6 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 3 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆11Mar 14, 2015Updated 10 years ago
- Web-based annotation tool for media data. The easiest way to create you own media dataset.☆16May 12, 2023Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- ☆90Sep 24, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆151Apr 25, 2025Updated 10 months ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- Image Animation with Perturbed Masks☆12Jun 6, 2022Updated 3 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated last year
- Interactive and realtime tools for assisting voicing and singing training☆19Jun 13, 2023Updated 2 years ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 10 months ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Pulse Model vocoder☆42Dec 5, 2018Updated 7 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year