Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
☆91Apr 13, 2021Updated 4 years ago
Alternatives and similar repositories for DiffWave-Vocoder
Users that are interested in DiffWave-Vocoder are comparing it to the libraries listed below
Sorting:
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆884Mar 26, 2024Updated last year
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 9 months ago
- ☆88Nov 1, 2022Updated 3 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Aug 18, 2021Updated 4 years ago
- A differentiable version of SPTK☆193Feb 3, 2026Updated 3 weeks ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆109Aug 26, 2021Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆199Oct 31, 2023Updated 2 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.☆43Apr 13, 2021Updated 4 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆408Jul 7, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 2 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆154Feb 1, 2023Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆230Jul 13, 2022Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Aug 30, 2021Updated 4 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆67Dec 28, 2020Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago