MiracleDance / CAR
CAR: Controllable AutoRegressive Modeling for Visual Generation
☆114Updated 4 months ago
Alternatives and similar repositories for CAR:
Users that are interested in CAR are comparing it to the libraries listed below
- This is the official implementation for ControlVAR.☆102Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆107Updated last month
- ☆146Updated this week
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆74Updated last month
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆40Updated this week
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆98Updated 3 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated 2 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆222Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆135Updated 2 months ago
- Official code for K-LoRA (CVPR 2025)☆99Updated last month
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆103Updated 2 weeks ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆134Updated last week
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆19Updated 3 weeks ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆169Updated this week
- Frequency Autoregressive Image Generation with Continuous Tokens☆56Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆149Updated last month
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆63Updated last week
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆199Updated last week
- Pixel-Space Generative Models☆165Updated last week
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆28Updated last month
- This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"☆106Updated 3 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- ☆29Updated 5 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆23Updated 4 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆147Updated 2 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆64Updated last week
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆105Updated 3 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆39Updated 2 weeks ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆183Updated last month