sony / sqvaeLinks
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆193Updated 3 years ago
Alternatives and similar repositories for sqvae
Users that are interested in sqvae are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆119Updated 2 years ago
- Contrastively Disentangled Sequential Variational Audoencoder☆48Updated last year
- PyTorch implementation of slicing adversarial network (SAN)☆99Updated last month
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆122Updated 3 years ago
- ☆44Updated 2 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Updated last year
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆75Updated 5 years ago
- [ICCV 2023] Online Clustered Codebook☆181Updated last year
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Updated 2 years ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆72Updated 6 months ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Updated 5 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆100Updated last year
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆65Updated 6 years ago
- A Pytorch Implementation of Finite Scalar Quantization☆173Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆230Updated 3 years ago
- PyTorch implementations of normalizing flow and its variants.☆79Updated 4 years ago
- PyTorch implementation of diffusion models.☆60Updated 4 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- Speech2Vec Reality Check☆88Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆64Updated 3 years ago
- ☆31Updated 2 years ago
- Code for our tutorial on Discrete Variational Autoencoders☆32Updated 8 months ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆153Updated 4 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆89Updated 3 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 3 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆91Updated 4 years ago
- A benchmarking suite for disentanglement algorithms, suited for evaluating robustness to correlated factors. Codebase for the paper "Dise…☆77Updated 2 years ago