CAR: Controllable AutoRegressive Modeling for Visual Generation
☆128Nov 29, 2024Updated last year
Alternatives and similar repositories for CAR
Users that are interested in CAR are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆323Apr 24, 2025Updated 10 months ago
- This is the official implementation for ControlVAR.☆125Dec 10, 2024Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- ☆34Dec 29, 2025Updated 2 months ago
- ☆10Nov 18, 2024Updated last year
- Implements VAR+CLIP for text-to-image (T2I) generation☆147Jan 23, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,547Nov 10, 2025Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆150Feb 19, 2025Updated last year
- ☆15May 7, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,117Mar 20, 2025Updated 11 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 8 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated 10 months ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆278Nov 24, 2025Updated 3 months ago
- ☆110Jul 9, 2024Updated last year
- This repo contains the code for PreciseControl project [ECCV'24]☆69Oct 6, 2024Updated last year
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 2 years ago
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Apr 10, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆62Jan 22, 2025Updated last year
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 8 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆627Oct 29, 2025Updated 4 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆321Mar 30, 2025Updated 11 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated last year
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆163Aug 26, 2025Updated 6 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆438Aug 8, 2025Updated 6 months ago
- ☆24Nov 1, 2024Updated last year
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆35Nov 21, 2024Updated last year
- Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”☆14Jan 5, 2026Updated last month
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆857May 23, 2025Updated 9 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 9 months ago
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆787Nov 8, 2025Updated 3 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆116Nov 26, 2024Updated last year
- Pixel-Space Generative Models☆301May 11, 2025Updated 9 months ago