HudsonHuang / waveglow_vocoderLinks
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
☆16Updated 8 months ago
Alternatives and similar repositories for waveglow_vocoder
Users that are interested in waveglow_vocoder are comparing it to the libraries listed below
Sorting:
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆120Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆65Updated 2 years ago
- ☆84Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- ☆32Updated 2 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆63Updated 5 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Updated 4 months ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆55Updated 3 years ago
- A DDSP-based neural voice synthesiser.☆119Updated 11 months ago
- ☆59Updated last year
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆127Updated 2 years ago
- ☆108Updated 2 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- A collection of audio autoencoders, in PyTorch.☆43Updated 2 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Updated 3 years ago
- Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)☆163Updated 4 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆74Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆119Updated 8 months ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Updated 4 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆64Updated 2 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆44Updated 3 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆119Updated last month
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆160Updated 3 years ago
- ☆64Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆88Updated 4 months ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago