HudsonHuang / waveglow_vocoder
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
☆16Updated 4 years ago
Alternatives and similar repositories for waveglow_vocoder:
Users that are interested in waveglow_vocoder are comparing it to the libraries listed below
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- A DDSP-based neural voice synthesiser.☆112Updated 2 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆57Updated 2 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆53Updated 2 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆61Updated 4 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆41Updated 3 months ago
- ☆79Updated last year
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆63Updated last year
- ☆64Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 6 months ago
- ☆34Updated 3 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated 3 months ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆57Updated 2 years ago
- ☆60Updated last year
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- ☆21Updated 6 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆154Updated 2 years ago
- ☆87Updated 2 years ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆61Updated last year
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated 2 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆38Updated 5 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- Reproducible Subjective Evaluation☆58Updated 10 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆54Updated 6 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆61Updated last month