zhanghm1995 / Awesome-VARLinks
A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image generation.
☆38Updated 8 months ago
Alternatives and similar repositories for Awesome-VAR
Users that are interested in Awesome-VAR are comparing it to the libraries listed below
Sorting:
- ICCV 2025-PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆50Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆200Updated 4 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆47Updated 7 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆95Updated 4 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆312Updated 7 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆146Updated 10 months ago
- [NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think☆193Updated last month
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆175Updated 8 months ago
- This is the official implementation for ControlVAR.☆125Updated 11 months ago
- ☆115Updated 3 months ago
- The first decoder-only multimodal state space model☆97Updated 6 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated 3 weeks ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆97Updated 7 months ago
- Visual Spatial Tuning☆146Updated 2 weeks ago
- Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"☆13Updated 7 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆162Updated 6 months ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆58Updated 4 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆42Updated last year
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated last month
- [IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation☆33Updated 2 years ago
- [RA-L] Generate Weather with LLM. Code for "WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segment…☆47Updated 5 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆60Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Updated last year
- ☆53Updated 2 months ago
- A list of works on video generation towards world model☆222Updated this week
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆125Updated last year
- A collection of vision foundation models unifying understanding and generation.☆59Updated 10 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆122Updated 3 weeks ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆159Updated last month
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆39Updated last month