☆12Nov 7, 2024Updated last year
Alternatives and similar repositories for ddsp-vocoder
Users that are interested in ddsp-vocoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆26Mar 20, 2024Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 5 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- ☆82Jan 22, 2025Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- ☆55Jul 16, 2025Updated 8 months ago
- Streaming Vocos☆30Jun 10, 2025Updated 9 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- a lightweight voice conversion☆86Feb 25, 2026Updated 3 weeks ago
- ☆13Sep 12, 2024Updated last year
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆45Jan 29, 2026Updated last month
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Real-time end-to-end singing voice convertion☆24Nov 3, 2024Updated last year
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- Pitch Controllable DDSP Vocoders☆79Nov 9, 2024Updated last year
- ☆32Oct 23, 2025Updated 5 months ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆123Mar 27, 2025Updated 11 months ago
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.☆31Jan 7, 2025Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Jul 14, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- ☆32Nov 24, 2024Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 9 months ago
- Solving Inverse Problems with Diffusion Optimal Control [NeurIPS 2024]☆19Dec 21, 2024Updated last year