Kangarroar / diff-svc-GUI
Extension program for DIFF-SVC to make it more easy to use
☆16Updated 2 years ago
Alternatives and similar repositories for diff-svc-GUI:
Users that are interested in diff-svc-GUI are comparing it to the libraries listed below
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated last year
- singing voice conversion without f0☆23Updated last year
- DiffSinger training colab notebook to make training easier hopefully☆37Updated last week
- 基于FreeVC的歌声转换☆21Updated 2 years ago
- MFA acoustic model training based on Opencpop☆12Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆20Updated 8 months ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆29Updated 5 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- ☆28Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 11 months ago
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆22Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆21Updated 2 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- ☆55Updated 2 years ago
- ☆31Updated 2 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆50Updated 2 years ago
- ☆39Updated last year
- ☆23Updated 5 months ago
- Bilingual-TTS (Japanese and Korean)☆29Updated last year
- The Original Support for English NNSVS Dataset Creation☆26Updated 2 months ago
- ☆19Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆39Updated 2 weeks ago
- ☆24Updated last year
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Updated 10 months ago
- A minimum inference engine for DiffSinger☆34Updated 9 months ago
- BigVGAN with Neural Source-Filter☆51Updated last year
- Python scripts I made to make NNSVS labeling easier.☆23Updated last year