🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆105Jun 23, 2024Updated last year
Alternatives and similar repositories for vaex
Users that are interested in vaex are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Dec 8, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,646Nov 10, 2025Updated 4 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆147Jan 23, 2025Updated last year
- ☆14Sep 22, 2025Updated 6 months ago
- This is the official implementation for ControlVAR.☆126Dec 10, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,553Nov 10, 2025Updated 4 months ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆323Jul 9, 2024Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- [ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization☆203Dec 18, 2025Updated 3 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,879Feb 20, 2026Updated last month
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 5 months ago
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆23Aug 17, 2025Updated 7 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆449Aug 8, 2025Updated 7 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆150Feb 19, 2025Updated last year
- This is a repo to track the latest autoregressive visual generation papers.☆432Jun 25, 2025Updated 8 months ago
- ☆162Apr 1, 2025Updated 11 months ago
- [ICML2025] VARSR: Visual Autogressive Modeling for Image Super Resolution☆167May 1, 2025Updated 10 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- ☆309May 29, 2025Updated 9 months ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆61Jul 8, 2025Updated 8 months ago
- [ICCV 2023] Online Clustered Codebook☆184Sep 19, 2024Updated last year
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆171May 1, 2025Updated 10 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆153Jul 24, 2025Updated 7 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Mar 25, 2024Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆191Jul 23, 2023Updated 2 years ago
- official repo for `thinking with images through-self-calling`☆25Dec 28, 2025Updated 2 months ago
- ☆10Sep 18, 2020Updated 5 years ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆129Nov 29, 2024Updated last year
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆330Jan 31, 2025Updated last year
- ☆143Jun 28, 2024Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,581Mar 16, 2025Updated last year
- [ICCV2025]Generate one 2K image on single 24GB 3090 GPU!☆84Sep 8, 2025Updated 6 months ago
- 📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. 🎉🎉☆15Mar 30, 2025Updated 11 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆115Oct 7, 2025Updated 5 months ago