praeclarumjj3 / VQ-VAE-on-MNISTLinks
VQ-VAE implementation in Pytorch
☆27Updated 5 years ago
Alternatives and similar repositories for VQ-VAE-on-MNIST
Users that are interested in VQ-VAE-on-MNIST are comparing it to the libraries listed below
Sorting:
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆88Updated 2 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Updated 2 years ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆67Updated 3 months ago
- Implementation of DiffWave and SaShiMi audio generation models☆127Updated 2 years ago
- Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.☆90Updated 4 years ago
- ☆27Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- Variational Autoencoder (VAE) with Normalizing Flows☆66Updated last year
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆69Updated 3 years ago
- A collection of audio autoencoders, in PyTorch.☆43Updated 2 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆12Updated last year
- JAX Implementations of Descript Audio Codec and EnCodec☆31Updated 7 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆120Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆91Updated 2 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆230Updated 3 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆59Updated 2 years ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆192Updated 3 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆122Updated 3 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆95Updated last year
- Audiogen Codec☆143Updated last year
- ☆84Updated 2 years ago
- ☆44Updated 3 years ago
- A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.☆16Updated 8 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆128Updated last year
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆127Updated last year
- ☆30Updated 3 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆132Updated 2 weeks ago
- ☆108Updated 2 months ago