xie-lab-ml / awesome-alignment-of-diffusion-models
The collection of awesome papers on alignment of diffusion models.
☆113Updated last week
Alternatives and similar repositories for awesome-alignment-of-diffusion-models:
Users that are interested in awesome-alignment-of-diffusion-models are comparing it to the libraries listed below
- This is a repo to track the latest autoregressive visual generation papers.☆143Updated this week
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆65Updated this week
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆47Updated 6 months ago
- a collection of awesome autoregressive visual generation models☆66Updated 3 weeks ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆102Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆73Updated last week
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆84Updated 4 months ago
- ☆75Updated 2 months ago
- Liquid: Language Models are Scalable Multi-modal Generators☆65Updated 2 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆35Updated 9 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆136Updated this week
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆85Updated 3 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆174Updated 3 weeks ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆28Updated 3 months ago
- ☆139Updated 2 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆25Updated last week
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆74Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆40Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆48Updated last month
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆183Updated 2 months ago
- Open implementation of "RandAR"☆54Updated last month
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆142Updated this week
- GenEval: An object-focused framework for evaluating text-to-image alignment☆176Updated 6 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆103Updated 4 months ago
- 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆253Updated last month