SOFA: Singing-Oriented Forced Aligner
☆210May 16, 2025Updated 9 months ago
Alternatives and similar repositories for SOFA
Users that are interested in SOFA are comparing it to the libraries listed below
Sorting:
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆32Jul 30, 2025Updated 7 months ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆39Aug 18, 2024Updated last year
- Hubert-based Forced Aligner☆33Updated this week
- Robust Singing Voice Transcription and MIDI Extraction☆112Nov 20, 2024Updated last year
- Pipelines and tools to build your own DiffSinger dataset.☆130Jan 16, 2026Updated last month
- ☆15Mar 31, 2025Updated 11 months ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆25May 28, 2024Updated last year
- ☆156Feb 6, 2025Updated last year
- ☆14Updated this week
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆214Apr 26, 2024Updated last year
- ☆189Oct 14, 2025Updated 4 months ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆56Nov 10, 2025Updated 3 months ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆120Jan 26, 2025Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆151Feb 2, 2026Updated last month
- ☆21Updated this week
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆96Oct 9, 2025Updated 4 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- a CustomTkInter GUI for processing and training DiffSinger models☆38Feb 25, 2026Updated last week
- ☆73Jan 12, 2026Updated last month
- DiffSinger dataset processing tools, including audio processing, labeling.☆69Updated this week
- DiffSinger training colab notebook to make training easier hopefully☆52Jul 24, 2025Updated 7 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆121Mar 27, 2025Updated 11 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 10 months ago
- ☆282Jan 30, 2026Updated last month
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- ☆47Aug 31, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- [ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models☆183Nov 22, 2024Updated last year
- ☆36Sep 6, 2025Updated 6 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆147Mar 23, 2024Updated last year
- Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing…☆350Aug 15, 2025Updated 6 months ago
- A DDSP-based neural voice synthesiser.☆129Nov 14, 2024Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆305Jan 25, 2024Updated 2 years ago