sony / sqvae
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆185Updated 2 years ago
Alternatives and similar repositories for sqvae:
Users that are interested in sqvae are comparing it to the libraries listed below
- ☆123Updated 11 months ago
- [ICCV 2023] Online Clustered Codebook☆157Updated 4 months ago
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆117Updated last year
- ☆35Updated last year
- Contrastively Disentangled Sequential Variational Audoencoder☆46Updated 3 months ago
- PyTorch implementation of slicing adversarial network (SAN)☆95Updated 7 months ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆142Updated 4 years ago
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆73Updated 4 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆82Updated 3 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆104Updated last year
- ☆304Updated 2 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆226Updated 2 years ago
- ☆29Updated last year
- Speech2Vec Reality Check☆80Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 7 months ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆30Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆84Updated 2 years ago
- Official PyTorch implementation of the paper: Flow Matching in Latent Space☆244Updated last week
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆111Updated 2 years ago
- Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising diffusion, in Pytorch☆341Updated last year
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆170Updated 2 years ago
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆123Updated 2 years ago
- ☆260Updated 3 months ago
- A differentiable argmax function for PyTorch☆43Updated 4 years ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆62Updated 8 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆126Updated 6 months ago
- ☆450Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆84Updated last year
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆134Updated 3 years ago