sony / sqvaeLinks
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆190Updated 2 years ago
Alternatives and similar repositories for sqvae
Users that are interested in sqvae are comparing it to the libraries listed below
Sorting:
- ☆134Updated last year
- Contrastively Disentangled Sequential Variational Audoencoder☆46Updated 9 months ago
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆117Updated last year
- PyTorch implementation of slicing adversarial network (SAN)☆98Updated last year
- ☆41Updated last year
- [ICCV 2023] Online Clustered Codebook☆174Updated 10 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 9 months ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆112Updated 3 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆31Updated last year
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆229Updated 3 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆66Updated last week
- A Pytorch Implementation of Finite Scalar Quantization☆141Updated last year
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆142Updated 4 years ago
- PyTorch implementation of diffusion models.☆59Updated 3 years ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆161Updated 2 years ago
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆59Updated 5 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Updated 4 years ago
- Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising diffusion, in Pytorch☆350Updated last year
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆61Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- Speech2Vec Reality Check☆83Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆125Updated 3 years ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆144Updated 3 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆125Updated 2 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆99Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago