Kangarroar / diff-svc-GUILinks
Extension program for DIFF-SVC to make it more easy to use
☆17Updated 3 years ago
Alternatives and similar repositories for diff-svc-GUI
Users that are interested in diff-svc-GUI are comparing it to the libraries listed below
Sorting:
- 基于FreeVC的歌声转换☆21Updated 3 years ago
- singing voice conversion without f0☆23Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- 大量の音声データから笑い声部分を集めるやつ☆12Updated last year
- singing voice conversion based on glow-tts☆12Updated 2 years ago
- ☆55Updated 3 years ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆55Updated 2 months ago
- MFA acoustic model training based on Opencpop☆15Updated 3 years ago
- ☆39Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 3 years ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆37Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- A Japanese G2P tool based on pyopenjtalk☆25Updated 3 years ago
- ☆20Updated 3 years ago
- ☆33Updated 3 years ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 5 months ago
- The Original Support for English NNSVS Dataset Creation☆29Updated last year
- ☆19Updated 2 years ago
- An Implementation of Singing Voice Conversion Based on Diffsinger☆73Updated 2 years ago
- DiffSinger training colab notebook to make training easier hopefully☆50Updated 6 months ago
- BigVGAN with Neural Source-Filter☆56Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆35Updated 11 months ago
- Implementation of Emo-StarGAN☆46Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated 2 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- ☆14Updated 5 months ago
- Bilingual-TTS (Japanese and Korean)☆32Updated 2 years ago
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Updated last year