hustvl / ControlAR
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
☆197Updated last month
Alternatives and similar repositories for ControlAR:
Users that are interested in ControlAR are comparing it to the libraries listed below
- [ICLR2025]☆137Updated last month
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆107Updated 3 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆394Updated this week
- [CVPR 2025] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆277Updated this week
- This is the official implementation for ControlVAR.☆95Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆123Updated last month
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆135Updated 2 months ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆124Updated this week
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆139Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆137Updated last week
- [CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆115Updated 2 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆247Updated this week
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆292Updated last month
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆221Updated last week
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆84Updated 6 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated 11 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆120Updated this week
- ☆76Updated 9 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆206Updated 2 weeks ago
- Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆55Updated last week
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆266Updated 2 months ago
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆22Updated this week
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.o…☆76Updated 10 months ago
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆278Updated 10 months ago
- [CVPR'25] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆124Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆201Updated last month
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆65Updated this week
- A collection of vision foundation models unifying understanding and generation.☆42Updated 2 months ago