sony / sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆176Updated 2 years ago
Related projects: ⓘ
- PyTorch implementation of slicing adversarial network (SAN)☆82Updated 3 months ago
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆115Updated last year
- Contrastively Disentangled Sequential Variational Audoencoder☆45Updated last year
- ☆103Updated 6 months ago
- [ICCV 2023] Online Clustered Codebook☆133Updated 9 months ago
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆71Updated 4 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆123Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆218Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆76Updated last year
- Official PyTorch implementation of the paper: Flow Matching in Latent Space☆177Updated last month
- ☆29Updated 8 months ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆130Updated 2 years ago
- ☆32Updated 3 years ago
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆120Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆52Updated last year
- A Pytorch Implementation of Finite Scalar Quantization☆68Updated 9 months ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆130Updated 2 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆108Updated last year
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆52Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆75Updated 3 months ago
- ☆163Updated last year
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆168Updated 2 years ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆155Updated last year
- Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models☆79Updated 3 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆77Updated last year
- ☆290Updated 2 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 2 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆86Updated 3 years ago
- ☆217Updated 4 months ago
- PyTorch implementation of diffusion models.☆58Updated 2 years ago