revsic / tf-diffwave
Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis
☆40Updated 4 years ago
Alternatives and similar repositories for tf-diffwave:
Users that are interested in tf-diffwave are comparing it to the libraries listed below
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆61Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆112Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆155Updated 2 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆78Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- ☆62Updated 9 months ago
- Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…☆38Updated 2 years ago
- ☆79Updated last year
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆47Updated last year
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Fast and differentiable time domain all-pole filter in PyTorch.☆54Updated this week
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- Reproducible Subjective Evaluation☆58Updated 10 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆98Updated 7 months ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆72Updated 2 weeks ago
- ☆91Updated 3 years ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆208Updated 2 weeks ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆54Updated 2 months ago
- ☆87Updated 2 years ago
- ☆43Updated 7 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆35Updated 6 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆51Updated 10 months ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆51Updated 2 years ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35Updated last year