CAR: Controllable AutoRegressive Modeling for Visual Generation
☆129Nov 29, 2024Updated last year
Alternatives and similar repositories for CAR
Users that are interested in CAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆324Apr 24, 2025Updated last year
- This is the official implementation for ControlVAR.☆127Dec 10, 2024Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- Implements VAR+CLIP for text-to-image (T2I) generation☆147Jan 23, 2025Updated last year
- ☆34Dec 29, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,948Aug 15, 2024Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,564Apr 16, 2026Updated 2 weeks ago
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 10 months ago
- ☆10Nov 18, 2024Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆150Feb 19, 2025Updated last year
- ☆111Jul 9, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,145Mar 20, 2025Updated last year
- ☆15May 7, 2024Updated last year
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆285Apr 8, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- High-performance Image Tokenizers for VAR and AR☆306Apr 25, 2025Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆652Oct 16, 2024Updated last year
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 2 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- This repo contains the code for PreciseControl project [ECCV'24]☆71Oct 6, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆459Aug 8, 2025Updated 8 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆645Oct 29, 2025Updated 6 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆429Jun 20, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 11 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 11 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆862Mar 19, 2026Updated last month
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆798Nov 8, 2025Updated 5 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆110Apr 10, 2024Updated 2 years ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆41Jun 13, 2025Updated 10 months ago
- Official Implementation for paper: BIFRÖST: 3D-Aware Image Compositng with Language Instructions☆29Dec 24, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated 2 years ago
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆172Nov 18, 2025Updated 5 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆109Sep 27, 2025Updated 7 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,904Feb 20, 2026Updated 2 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 8 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆645Oct 16, 2025Updated 6 months ago