unixpickle / vq-voice-swap
Voice swapping with VQ-VAE and diffusion models
☆67Updated 3 years ago
Alternatives and similar repositories for vq-voice-swap:
Users that are interested in vq-voice-swap are comparing it to the libraries listed below
- Demo for 2022 ICASSP☆64Updated 2 years ago
- ☆83Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆88Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…☆38Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- The demo page of UniAudio☆33Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆83Updated 5 months ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated last year
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆85Updated last year
- ☆28Updated 3 years ago
- ☆14Updated 3 years ago
- ☆66Updated 2 weeks ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆78Updated 4 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆75Updated 3 years ago
- ☆23Updated last year
- ☆31Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- Repo for structured dreaming☆55Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- ☆20Updated 3 years ago
- ☆22Updated 2 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆61Updated 4 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- Demo for 2022 Interspeech☆29Updated 2 years ago