spicytigermeat / LabelMakr
A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singing.
☆31Updated 5 months ago
Alternatives and similar repositories for LabelMakr:
Users that are interested in LabelMakr are comparing it to the libraries listed below
- DiffSinger training colab notebook to make training easier hopefully☆38Updated 3 weeks ago
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆23Updated 3 months ago
- Python scripts I made to make NNSVS labeling easier.☆23Updated last year
- The Original Support for English NNSVS Dataset Creation☆26Updated 3 months ago
- Open source voice labeling application☆154Updated 3 months ago
- SOFA: Singing-Oriented Forced Aligner☆148Updated this week
- a CustomTkInter GUI for processing and training DiffSinger models☆15Updated this week
- Pipelines and tools to build your own DiffSinger dataset.☆93Updated 9 months ago
- DiffSinger dataset processing tools, including audio processing, labeling.☆52Updated 3 months ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆106Updated 3 weeks ago
- UST入りの歌唱DBからENUNU用モデルを生成するツール☆21Updated 2 years ago
- ☆62Updated last year
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆21Updated 8 months ago
- A universal converter for singing voice projects which is cross-platform and multi-lingual☆52Updated this week
- ☆123Updated this week
- VITS with phoneme-level prosody modeling based on MaskGIT☆80Updated 5 months ago
- Vocal Remover using Deep Neural Networks☆16Updated last month
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆31Updated 3 weeks ago
- Pitch Controllable DDSP Vocoders☆70Updated 3 months ago
- ☆24Updated last year
- NNSVSのモデルをUTAUで使えるようにするツール (UTAU plugin software powered by NNSVS)☆95Updated 4 months ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆52Updated last year
- ☆38Updated 5 months ago
- Open-source file format designed for high-quality, customizable singing synthesis.☆12Updated last week
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- Line by line audio recording tool for vocal libraries☆28Updated 11 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- A python GUI toolkit for creating/editing Aesthetic YAML dictionaries for OpenUtau☆22Updated 5 months ago
- ☆16Updated 2 months ago