thestmitsuki / so-vits-svc-rmvpe
only rmvpe
☆22Updated last year
Alternatives and similar repositories for so-vits-svc-rmvpe:
Users that are interested in so-vits-svc-rmvpe are comparing it to the libraries listed below
- dog-can-sing-song☆20Updated 3 months ago
- ☆38Updated 5 months ago
- ☆39Updated last year
- Singing Voice Speech modeling test☆35Updated 2 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆23Updated last month
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Unofficial implementation of NANSY++ in Pytorch Lightning☆51Updated 11 months ago
- ☆44Updated last year
- ☆22Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆63Updated 10 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- ☆31Updated 2 years ago
- ☆29Updated last year
- Sovits5 with RMVPE☆14Updated last year
- Implementation of StyleTTS for Mandarin☆11Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆16Updated last month
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 7 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆41Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 2 years ago
- ☆64Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- singing voice conversion without f0☆23Updated last year
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- Reimplementation of Miipher☆20Updated last year
- A toolkit for any-to-any encoder-decoder voice conversion systems☆83Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆48Updated 2 weeks ago