lxa9867 / Awesome-Autoregressive-Visual-Generation
This is a repo to track the latest autoregressive visual generation papers.
☆139Updated last week
Alternatives and similar repositories for Awesome-Autoregressive-Visual-Generation:
Users that are interested in Awesome-Autoregressive-Visual-Generation are comparing it to the libraries listed below
- 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆253Updated last month
- The collection of awesome papers on alignment of diffusion models.☆109Updated last week
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆200Updated last week
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆174Updated 3 weeks ago
- This is the official implementation for ControlVAR.☆95Updated 2 months ago
- a collection of awesome autoregressive visual generation models☆66Updated 3 weeks ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆119Updated 3 weeks ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆89Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆136Updated this week
- 📚 Collection of awesome generation acceleration resources.☆139Updated this week
- ☆53Updated 3 weeks ago
- Benchmark for generative image models☆74Updated last year
- Liquid: Language Models are Scalable Multi-modal Generators☆65Updated 2 months ago
- ☆138Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆102Updated last week
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆65Updated this week
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆102Updated 2 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆80Updated 7 months ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆234Updated 2 weeks ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆73Updated last week
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆45Updated 6 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆117Updated last month
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆58Updated 2 weeks ago
- Scaling Diffusion Transformers with Mixture of Experts☆259Updated 5 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆182Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆84Updated 4 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆120Updated last month
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆45Updated last month