yjzxkxdn / Mini-DDSPView external linksLinks
☆15Mar 31, 2025Updated 10 months ago
Alternatives and similar repositories for Mini-DDSP
Users that are interested in Mini-DDSP are comparing it to the libraries listed below
Sorting:
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Hubert-based Forced Aligner☆30Jan 20, 2026Updated 3 weeks ago
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- Public female English corpus used for Project AI❤dol☆14Dec 28, 2025Updated last month
- ☆14Feb 2, 2026Updated last week
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆11Nov 2, 2024Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 5 months ago
- ☆15Nov 11, 2024Updated last year
- Pitch Controllable DDSP Vocoders☆78Nov 9, 2024Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆55Nov 10, 2025Updated 3 months ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24May 28, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- [RecurrentNN × Regression × Regularized]-base Mouth Opening Estimation via SSL(Semi-supervised Learning).☆21Dec 6, 2025Updated 2 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week
- ☆19Sep 20, 2024Updated last year
- ☆47Aug 31, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last week
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- ☆15Aug 22, 2025Updated 5 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Nov 12, 2025Updated 3 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- ☆19May 2, 2024Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- SOFA: Singing-Oriented Forced Aligner☆207May 16, 2025Updated 8 months ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆154Feb 6, 2025Updated last year
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆67Nov 1, 2024Updated last year