wenhaochai/Awesome-VQVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenhaochai/Awesome-VQVAE)

wenhaochai / Awesome-VQVAE

A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

☆333

Alternatives and similar repositories for Awesome-VQVAE

Users that are interested in Awesome-VQVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CrossmodalGroup / DynamicVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…
☆194Jul 23, 2023Updated 2 years ago
lucidrains / vector-quantize-pytorch
View on GitHub
Vector (and Scalar) Quantization, in Pytorch
☆3,982Updated this week
magic-research / vector_quantization
View on GitHub
[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆21Dec 17, 2024Updated last year
zh460045050 / VQGAN-LC
View on GitHub
☆145Jun 28, 2024Updated 2 years ago
lucidrains / magvit2-pytorch
View on GitHub
Implementation of MagViT2 Tokenizer in Pytorch
☆668Jan 12, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,017Nov 25, 2025Updated 7 months ago
sony / sqvae
View on GitHub
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆196Jul 20, 2022Updated 4 years ago
lyndonzheng / CVQ-VAE
View on GitHub
[ICCV 2023] Online Clustered Codebook
☆189Sep 19, 2024Updated last year
rosinality / vq-vae-2-pytorch
View on GitHub
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
☆1,803Feb 15, 2023Updated 3 years ago
FoundationVision / vaex
View on GitHub
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆108Jun 23, 2024Updated 2 years ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vvvm23 / vqvae-2
View on GitHub
PyTorch implementation of VQ-VAE-2 from "Generating Diverse High-Fidelity Images with VQ-VAE-2"
☆168Feb 15, 2023Updated 3 years ago
SerezD / vqvae-vqgan-pytorch-lightning
View on GitHub
VQ-VAE/GAN implementation in pytorch-lightning
☆49Nov 4, 2024Updated last year
MishaLaskin / vqvae
View on GitHub
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
☆898Dec 8, 2022Updated 3 years ago
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Updated this week
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
FoundationVision / OmniTokenizer
View on GitHub
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
☆325Jul 9, 2024Updated 2 years ago
amzn / sparse-vqvae
View on GitHub
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
☆36Oct 27, 2023Updated 2 years ago
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,724Jun 16, 2026Updated last month
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CrossmodalGroup / MaskedVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…
☆68Jul 21, 2023Updated 3 years ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,708Nov 10, 2025Updated 8 months ago
wenhaochai / Awesome-DriveLM
View on GitHub
📚 A collection of resources and papers on Large Language Models in autonomous driving
☆27Oct 30, 2023Updated 2 years ago
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
minyoungg / vqtorch
View on GitHub
☆145Feb 27, 2024Updated 2 years ago
Rem105-210 / DiffFashion
View on GitHub
☆82Mar 15, 2023Updated 3 years ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
cientgu / VQ-Diffusion
View on GitHub
☆487Jun 30, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / rvq-vae-gpt
View on GitHub
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆90Oct 11, 2024Updated last year
CompVis / taming-transformers
View on GitHub
Taming Transformers for High-Resolution Image Synthesis
☆6,520Jul 30, 2024Updated last year
ChenHsing / Awesome-Video-Diffusion-Models
View on GitHub
[CSUR] A Survey on Video Diffusion Models
☆2,303Jun 22, 2026Updated 3 weeks ago
wenhaochai / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
wenhaochai / STEVE
View on GitHub
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆308Jul 4, 2026Updated 2 weeks ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,687May 31, 2024Updated 2 years ago