rese1f/Awesome-VQVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rese1f/Awesome-VQVAE)

rese1f / Awesome-VQVAE

A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

☆331

Alternatives and similar repositories for Awesome-VQVAE

Users that are interested in Awesome-VQVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CrossmodalGroup / DynamicVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…
☆191Jul 23, 2023Updated 2 years ago
lucidrains / vector-quantize-pytorch
View on GitHub
Vector (and Scalar) Quantization, in Pytorch
☆3,888Mar 30, 2026Updated last week
magic-research / vector_quantization
View on GitHub
[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆22Dec 17, 2024Updated last year
zh460045050 / VQGAN-LC
View on GitHub
☆144Jun 28, 2024Updated last year
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,002Nov 25, 2025Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lucidrains / magvit2-pytorch
View on GitHub
Implementation of MagViT2 Tokenizer in Pytorch
☆660Jan 12, 2025Updated last year
sony / sqvae
View on GitHub
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆194Jul 20, 2022Updated 3 years ago
lyndonzheng / CVQ-VAE
View on GitHub
[ICCV 2023] Online Clustered Codebook
☆184Sep 19, 2024Updated last year
rosinality / vq-vae-2-pytorch
View on GitHub
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
☆1,799Feb 15, 2023Updated 3 years ago
FoundationVision / vaex
View on GitHub
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆105Jun 23, 2024Updated last year
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆15Mar 12, 2024Updated 2 years ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,140Mar 20, 2025Updated last year
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 4 years ago
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
vvvm23 / vqvae-2
View on GitHub
PyTorch implementation of VQ-VAE-2 from "Generating Diverse High-Fidelity Images with VQ-VAE-2"
☆167Feb 15, 2023Updated 3 years ago
MishaLaskin / vqvae
View on GitHub
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
☆889Dec 8, 2022Updated 3 years ago
SerezD / vqvae-vqgan-pytorch-lightning
View on GitHub
VQ-VAE/GAN implementation in pytorch-lightning
☆49Nov 4, 2024Updated last year
FoundationVision / OmniTokenizer
View on GitHub
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
☆322Jul 9, 2024Updated last year
amzn / sparse-vqvae
View on GitHub
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
☆35Oct 27, 2023Updated 2 years ago
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,563Updated this week
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,941Aug 15, 2024Updated last year
Ephemeral182 / Empirical-Study-of-GPT-4o-Image-Gen
View on GitHub
An Empirical Study of GPT-4o Image Generation Capabilities
☆29Apr 16, 2025Updated 11 months ago
CrossmodalGroup / MaskedVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…
☆66Jul 21, 2023Updated 2 years ago
DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,668Nov 10, 2025Updated 4 months ago
rese1f / Awesome-DriveLM
View on GitHub
📚 A collection of resources and papers on Large Language Models in autonomous driving
☆27Oct 30, 2023Updated 2 years ago
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆997Jan 17, 2024Updated 2 years ago
cientgu / VQ-Diffusion
View on GitHub
☆486Jun 30, 2022Updated 3 years ago
minyoungg / vqtorch
View on GitHub
☆145Feb 27, 2024Updated 2 years ago
Rem105-210 / DiffFashion
View on GitHub
☆81Mar 15, 2023Updated 3 years ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
lucidrains / rvq-vae-gpt
View on GitHub
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆90Oct 11, 2024Updated last year
rese1f / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
ChenHsing / Awesome-Video-Diffusion-Models
View on GitHub
[CSUR] A Survey on Video Diffusion Models
☆2,287Mar 14, 2026Updated 3 weeks ago
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆301Apr 1, 2026Updated last week
rese1f / STEVE
View on GitHub
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
kakaobrain / hqtransformer
View on GitHub
Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)
☆29Feb 14, 2024Updated 2 years ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,479May 31, 2024Updated last year
neu-vi / FleVRS
View on GitHub
FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024
☆22Dec 9, 2024Updated last year
0nutation / USLM
View on GitHub
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆151Sep 14, 2023Updated 2 years ago