lxa9867 / Awesome-Autoregressive-Visual-Generation
This is a repo to track the latest autoregressive visual generation papers.
β103Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Autoregressive-Visual-Generation:
Users that are interested in Awesome-Autoregressive-Visual-Generation are comparing it to the libraries listed below
- a collection of awesome autoregressive visual generation modelsβ63Updated 2 weeks ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficientβ75Updated last month
- XQ-GANπ: An Open-source Image Tokenization Framework for Autoregressive Generationβ178Updated last month
- The collection of awesome papers on alignment of diffusion models.β72Updated last month
- This is the official implementation for ControlVAR.β88Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Trainingβ162Updated 2 months ago
- β128Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ133Updated 7 months ago
- βFlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matchingβ FlowAR employs a simplest scale design and is compatible with anβ¦β72Updated 3 weeks ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"β43Updated this week
- π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β222Updated 2 weeks ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)β97Updated 2 weeks ago
- β47Updated 2 weeks ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ57Updated 2 months ago
- Liquid: Language Models are Scalable Multi-modal Generatorsβ60Updated last month
- Implements VAR+CLIP for text-to-image (T2I) generationβ112Updated 2 weeks ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Googleβ42Updated 5 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"β80Updated 3 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"β23Updated 2 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generationβ59Updated this week
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"β176Updated 3 months ago
- π Collection of awesome generation acceleration resources.β93Updated this week
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ94Updated last month
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Cachingβ91Updated 6 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]β67Updated last month
- β26Updated 5 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β34Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ83Updated 6 months ago
- Benchmark for generative image modelsβ72Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generationβ114Updated this week