maxin-cn / Awesome-Autoregressive-Visual-Generation-Models
a collection of awesome autoregressive visual generation models
☆39Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Autoregressive-Visual-Generation-Models
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆48Updated last month
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆84Updated 7 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆72Updated last year
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆46Updated 2 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated 3 weeks ago
- ICCV2023-Diffusion-Papers☆110Updated last year
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆61Updated 2 months ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆65Updated 8 months ago
- ☆26Updated 3 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆49Updated 2 weeks ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆82Updated 2 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆41Updated last month
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆37Updated last year
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆16Updated 5 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆25Updated 2 months ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…☆50Updated last month
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆46Updated 6 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆21Updated 6 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆48Updated last week
- ☆18Updated 9 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆35Updated 3 weeks ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆34Updated 6 months ago
- ☆30Updated 2 weeks ago
- Code for FineRewards☆19Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆18Updated last week
- ☆119Updated last month
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆109Updated 2 months ago