unixpickle / vq-voice-swapLinks
Voice swapping with VQ-VAE and diffusion models
☆67Updated 3 years ago
Alternatives and similar repositories for vq-voice-swap
Users that are interested in vq-voice-swap are comparing it to the libraries listed below
Sorting:
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- ☆20Updated 3 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆88Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 9 months ago
- Contrastive Language-Audio Pretraining☆87Updated 3 years ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- The demo page of UniAudio☆34Updated last year
- ☆31Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- ☆23Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 4 years ago
- ☆85Updated 2 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Updated 3 years ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- ☆107Updated last year
- Repository of TräumerAI, based on PyTorch implementation of StyleGAN 2☆30Updated 3 years ago
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆71Updated 2 years ago
- ☆71Updated 2 weeks ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆20Updated 4 years ago
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- ☆64Updated 3 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆113Updated 3 years ago