CAR: Controllable AutoRegressive Modeling for Visual Generation
☆129Nov 29, 2024Updated last year
Alternatives and similar repositories for CAR
Users that are interested in CAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆324Apr 24, 2025Updated 10 months ago
- This is the official implementation for ControlVAR.☆126Dec 10, 2024Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- Implements VAR+CLIP for text-to-image (T2I) generation☆147Jan 23, 2025Updated last year
- ☆34Dec 29, 2025Updated 2 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,553Nov 10, 2025Updated 4 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆432Jun 25, 2025Updated 8 months ago
- ☆10Nov 18, 2024Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆150Feb 19, 2025Updated last year
- ☆111Jul 9, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- ☆15May 7, 2024Updated last year
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆280Nov 24, 2025Updated 3 months ago
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 10 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 2 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- This repo contains the code for PreciseControl project [ECCV'24]☆70Oct 6, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆449Aug 8, 2025Updated 7 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆635Oct 29, 2025Updated 4 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 9 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 10 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 10 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆860Updated this week
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆790Nov 8, 2025Updated 4 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Apr 10, 2024Updated last year
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated 11 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆41Jun 13, 2025Updated 9 months ago
- Official Implementation for paper: BIFRÖST: 3D-Aware Image Compositng with Language Instructions☆29Dec 24, 2025Updated 2 months ago
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆170Nov 18, 2025Updated 4 months ago
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 5 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,879Feb 20, 2026Updated last month
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 6 months ago
- Pixel-Space Generative Models☆305May 11, 2025Updated 10 months ago