philsyn / DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
☆86Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for DiffWave-Vocoder
- Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.☆33Updated 3 years ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- ☆34Updated 3 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆61Updated last year
- Training code and trained checkpoints for ASGAN.☆60Updated 10 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆72Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆188Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆94Updated 8 months ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 3 years ago
- ☆79Updated last year
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆142Updated 4 years ago
- ☆63Updated last year
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆67Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago
- TODO☆34Updated last year
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Updated 2 years ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆23Updated 8 months ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆223Updated 2 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆114Updated 5 months ago
- ☆46Updated 4 years ago
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Alignment files of LibriTTS.☆60Updated 4 years ago
- A collection of audio autoencoders, in PyTorch.☆39Updated last year
- ☆90Updated 3 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- ☆36Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆203Updated last month