☆67Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for diffiner
Users that are interested in diffiner are comparing it to the libraries listed below
Sorting:
- ☆15Nov 11, 2024Updated last year
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 9 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆38Feb 1, 2024Updated 2 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Feb 9, 2025Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆111Aug 29, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- ☆21Jul 15, 2024Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆253Sep 13, 2024Updated last year
- Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"☆88Jun 10, 2024Updated last year
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆250Dec 20, 2022Updated 3 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆124Mar 15, 2024Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆155Oct 16, 2023Updated 2 years ago
- Unofficial implementation of miipher☆135Apr 19, 2024Updated last year
- ☆40Jan 24, 2023Updated 3 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 8 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- ☆54Mar 2, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- Viterbi decoding in PyTorch☆40Sep 10, 2025Updated 5 months ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year