HudsonHuang / waveglow_vocoderLinks
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
☆16Updated 10 months ago
Alternatives and similar repositories for waveglow_vocoder
Users that are interested in waveglow_vocoder are comparing it to the libraries listed below
Sorting:
- ☆64Updated 2 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆64Updated 5 years ago
- PyTorch Dataset for Speech and Music audio☆79Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆127Updated 2 years ago
- ☆85Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆69Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Updated 2 years ago
- A DDSP-based neural voice synthesiser.☆124Updated last year
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- A collection of audio autoencoders, in PyTorch.☆44Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Updated 4 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆38Updated 3 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Updated 6 months ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆91Updated 4 years ago
- ☆32Updated 2 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆77Updated 2 years ago
- Variational auto-encoders for audio☆127Updated 5 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- ☆60Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆191Updated 3 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆162Updated 3 years ago
- ☆90Updated 4 years ago
- Yin pitch estimator in PyTorch☆117Updated 3 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated 2 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Updated last year