MLo7Ghinsan / DiffSinger_colab_notebook_MLo7
DiffSinger training colab notebook to make training easier hopefully
☆32Updated last month
Related projects: ⓘ
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆27Updated last month
- The Original Support for English NNSVS Dataset Creation☆23Updated 3 months ago
- Python scripts I made to make NNSVS labeling easier.☆22Updated last year
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆21Updated 2 months ago
- UST入りの歌唱DBからENUNU用モデルを生成するツール☆20Updated last year
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated last year
- SOFA: Singing-Oriented Forced Aligner☆118Updated this week
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆92Updated last week
- ☆33Updated this week
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆22Updated last year
- Pipelines and tools to build your own DiffSinger dataset.☆86Updated 5 months ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆48Updated 11 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆71Updated 2 weeks ago
- ☆27Updated 10 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- ☆54Updated 11 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆19Updated 4 months ago
- Korean language support for NNSVS/ENUNU☆26Updated 5 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆182Updated 10 months ago
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆65Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆124Updated 10 months ago
- Open source voice labeling application☆145Updated 2 weeks ago
- Extension program for DIFF-SVC to make it more easy to use☆17Updated last year
- DiffSinger dataset processing tools, including audio processing, labeling.☆46Updated last week
- Music generation☆24Updated 4 months ago
- ☆92Updated last month
- ☆38Updated 2 weeks ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆28Updated 2 years ago
- Zero-Shot Emotion Style Transfer☆33Updated 5 months ago
- All generative model in one for better TTS model☆64Updated last week