MLo7Ghinsan / DiffSinger_colab_notebook_MLo7
DiffSinger training colab notebook to make training easier hopefully
☆38Updated 2 months ago
Alternatives and similar repositories for DiffSinger_colab_notebook_MLo7:
Users that are interested in DiffSinger_colab_notebook_MLo7 are comparing it to the libraries listed below
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆33Updated 7 months ago
- The Original Support for English NNSVS Dataset Creation☆28Updated 4 months ago
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆23Updated 4 months ago
- Python scripts I made to make NNSVS labeling easier.☆23Updated last year
- a CustomTkInter GUI for processing and training DiffSinger models☆17Updated last week
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆36Updated this week
- SOFA: Singing-Oriented Forced Aligner☆151Updated last month
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆110Updated 2 months ago
- Pipelines and tools to build your own DiffSinger dataset.☆96Updated 11 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆71Updated 4 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆57Updated last month
- ☆64Updated last year
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆53Updated last year
- Vocal Remover using Deep Neural Networks☆17Updated 2 months ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- UST入りの歌唱DBからENUNU用モデルを生成するツール☆21Updated 2 years ago
- ☆38Updated 6 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 6 months ago
- DiffSinger dataset processing tools, including audio processing, labeling.☆54Updated this week
- ☆128Updated last month
- Music generation☆24Updated 10 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆66Updated 8 months ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- ☆24Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆51Updated 2 years ago
- BigVGAN with Neural Source-Filter☆51Updated last year
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆73Updated 5 months ago
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆57Updated 2 months ago