unixpickle / vq-voice-swapLinks
Voice swapping with VQ-VAE and diffusion models
☆68Updated 4 years ago
Alternatives and similar repositories for vq-voice-swap
Users that are interested in vq-voice-swap are comparing it to the libraries listed below
Sorting:
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- ☆20Updated 4 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Contrastive Language-Audio Pretraining☆88Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆130Updated 2 years ago
- ☆87Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 4 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆89Updated last year
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- ☆28Updated 4 years ago
- ☆64Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 3 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆111Updated 3 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆93Updated 2 years ago
- ☆22Updated 3 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Updated 4 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- ☆23Updated 2 years ago
- Pytorch Implementation of WaveNODE☆64Updated 5 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆68Updated 4 years ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆60Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 4 years ago