philsyn / DiffWave-unconditionalLinks
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
☆42Updated 4 years ago
Alternatives and similar repositories for DiffWave-unconditional
Users that are interested in DiffWave-unconditional are comparing it to the libraries listed below
Sorting:
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Updated 4 years ago
- A collection of audio autoencoders, in PyTorch.☆43Updated 2 years ago
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆239Updated 2 years ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆30Updated last year
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆229Updated 3 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆86Updated 2 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆117Updated 2 years ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆226Updated 2 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆109Updated last year
- Diffusion Model for Voice Conversion☆55Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆125Updated 2 years ago
- A lightweight library for Frechet Audio Distance calculation.☆286Updated 10 months ago
- DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023☆56Updated 2 months ago
- ☆30Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆227Updated 10 months ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆122Updated 8 months ago
- ☆161Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆99Updated 10 months ago
- ☆83Updated 2 years ago
- A fast, high-quality neural vocoder.☆288Updated 2 years ago
- ☆36Updated 3 years ago
- Variational auto-encoders for audio☆123Updated 5 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆66Updated 11 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- Unofficial download repository for MusicCaps☆47Updated 2 years ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated last week
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆70Updated 2 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆108Updated 11 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆129Updated last year
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆50Updated 11 months ago