rese1f / Awesome-VQVAE
A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application
☆241Updated 9 months ago
Alternatives and similar repositories for Awesome-VQVAE:
Users that are interested in Awesome-VQVAE are comparing it to the libraries listed below
- A Pytorch Implementation of Finite Scalar Quantization☆104Updated last year
- unofficial MaskGIT reproduction in PyTorch☆182Updated 11 months ago
- Implementation of Autoregressive Diffusion in Pytorch☆343Updated 2 months ago
- [ICCV 2023] Online Clustered Codebook☆157Updated 3 months ago
- Consistency Models Made Easy☆253Updated 3 months ago
- ☆123Updated 10 months ago
- ☆258Updated 3 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆385Updated 8 months ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆295Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆166Updated last year
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆177Updated this week
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆231Updated this week
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆380Updated last year
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆266Updated 3 weeks ago
- ☆320Updated last month
- ☆449Updated 2 years ago
- Implementation of MagViT2 Tokenizer in Pytorch☆588Updated this week
- Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).☆314Updated 8 months ago
- MoVQGAN - model for the image encoding and reconstruction☆212Updated last year
- A summary of related works about flow matching, stochastic interpolants☆371Updated 5 months ago
- This repo contains the code for 1D tokenizer and generator☆645Updated this week
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆223Updated 8 months ago
- ☆69Updated 2 months ago
- ☆211Updated last year
- Scalable Diffusion Models with State Space Backbone☆149Updated 10 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆285Updated this week
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆229Updated last year
- ☆56Updated 3 months ago
- A mini-library for training consistency models.☆234Updated last year
- 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆222Updated 2 weeks ago