philsyn / DiffWave-unconditionalLinks
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
☆43Updated 4 years ago
Alternatives and similar repositories for DiffWave-unconditional
Users that are interested in DiffWave-unconditional are comparing it to the libraries listed below
Sorting:
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆91Updated 4 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆89Updated 2 years ago
- A collection of audio autoencoders, in PyTorch.☆44Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆230Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Updated 2 years ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Updated last year
- ☆38Updated last year
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆235Updated 7 months ago
- ☆85Updated 2 years ago
- Diffusion Model for Voice Conversion☆65Updated last year
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆42Updated 5 years ago
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆246Updated 2 years ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆130Updated 2 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆127Updated 2 years ago
- A fast, high-quality neural vocoder.☆294Updated 2 years ago
- Unofficial download repository for MusicCaps☆48Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆92Updated 5 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆121Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆110Updated last year
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 3 weeks ago
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆193Updated 4 months ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆131Updated last month
- DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023☆59Updated 6 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- ☆36Updated 4 years ago
- Official implementation of SawSing (ISMIR'22)☆269Updated 3 years ago
- ☆34Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆68Updated 3 years ago
- An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.☆24Updated last month