qiuqiao / SOFAView external linksLinks
SOFA: Singing-Oriented Forced Aligner
☆207May 16, 2025Updated 8 months ago
Alternatives and similar repositories for SOFA
Users that are interested in SOFA are comparing it to the libraries listed below
Sorting:
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆31Jul 30, 2025Updated 6 months ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆37Aug 18, 2024Updated last year
- Hubert-based Forced Aligner☆30Jan 20, 2026Updated 3 weeks ago
- Robust Singing Voice Transcription and MIDI Extraction☆109Nov 20, 2024Updated last year
- Pipelines and tools to build your own DiffSinger dataset.☆128Jan 16, 2026Updated 3 weeks ago
- ☆15Mar 31, 2025Updated 10 months ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24May 28, 2024Updated last year
- ☆154Feb 6, 2025Updated last year
- ☆14Feb 2, 2026Updated last week
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆211Apr 26, 2024Updated last year
- ☆188Oct 14, 2025Updated 4 months ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆55Nov 10, 2025Updated 3 months ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆119Jan 26, 2025Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆150Feb 2, 2026Updated last week
- ☆21Dec 18, 2025Updated last month
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆95Oct 9, 2025Updated 4 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- ☆73Jan 12, 2026Updated last month
- a CustomTkInter GUI for processing and training DiffSinger models☆37Jan 26, 2026Updated 2 weeks ago
- DiffSinger dataset processing tools, including audio processing, labeling.☆69Jan 20, 2026Updated 3 weeks ago
- DiffSinger training colab notebook to make training easier hopefully☆50Jul 24, 2025Updated 6 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆121Mar 27, 2025Updated 10 months ago
- ☆277Jan 30, 2026Updated 2 weeks ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆108Jan 17, 2025Updated last year
- ☆47Aug 31, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 9 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- [ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models☆183Nov 22, 2024Updated last year
- ☆36Sep 6, 2025Updated 5 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆146Mar 23, 2024Updated last year
- A DDSP-based neural voice synthesiser.☆127Nov 14, 2024Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆299Jan 25, 2024Updated 2 years ago
- Audio-FLAN☆160Sep 23, 2025Updated 4 months ago