praeclarumjj3 / VQ-VAE-on-MNISTLinks
VQ-VAE implementation in Pytorch
☆24Updated 5 years ago
Alternatives and similar repositories for VQ-VAE-on-MNIST
Users that are interested in VQ-VAE-on-MNIST are comparing it to the libraries listed below
Sorting:
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆31Updated last year
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆84Updated 2 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆59Updated 2 years ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆66Updated last year
- A collection of audio autoencoders, in PyTorch.☆42Updated 2 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆11Updated 9 months ago
- Variational Autoencoder (VAE) with Normalizing Flows☆62Updated 8 months ago
- ☆26Updated 10 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 8 months ago
- ☆29Updated last year
- ☆44Updated 7 months ago
- small audio language model for reasoning☆64Updated 2 months ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 3 years ago
- A PyTorch implementation of Bayesian flow networks (Graves et al., 2023).☆26Updated last year
- ☆83Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆68Updated 2 years ago
- The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)☆77Updated 6 months ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Updated 4 years ago
- Official repository of Wavehax vocoder☆52Updated 6 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆11Updated last year
- Educational implementation of the Discrete Flow Matching paper☆92Updated 10 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 5 months ago
- ☆31Updated 2 years ago
- ☆15Updated 2 years ago
- ☆98Updated last month
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆70Updated 5 months ago