sony / sqvaeLinks
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆188Updated 2 years ago
Alternatives and similar repositories for sqvae
Users that are interested in sqvae are comparing it to the libraries listed below
Sorting:
- ☆129Updated last year
- [ICCV 2023] Online Clustered Codebook☆171Updated 8 months ago
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆117Updated last year
- ☆39Updated last year
- PyTorch implementation of slicing adversarial network (SAN)☆99Updated 11 months ago
- Contrastively Disentangled Sequential Variational Audoencoder☆46Updated 7 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆136Updated last year
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆125Updated 3 years ago
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆61Updated 2 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆142Updated 4 years ago
- PyTorch implementation of diffusion models.☆59Updated 3 years ago
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆73Updated 5 years ago
- [NeurIPS 2023] code for "DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models☆68Updated last year
- ☆315Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆228Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆83Updated 11 months ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆109Updated 3 years ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆146Updated 3 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆31Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆172Updated 3 years ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆216Updated 4 months ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆84Updated last year
- ☆289Updated 7 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆91Updated 11 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 7 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated 10 months ago
- Official implementation of "DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents"☆366Updated 2 years ago