XueZeyue / Awesome-Visual-Generation-Alignment-SurveyView external linksLinks
A survey for visual generation alignment
☆119Nov 9, 2025Updated 3 months ago
Alternatives and similar repositories for Awesome-Visual-Generation-Alignment-Survey
Users that are interested in Awesome-Visual-Generation-Alignment-Survey are comparing it to the libraries listed below
Sorting:
- ☆63Jul 10, 2025Updated 7 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆38Jan 5, 2026Updated last month
- ☆18Jul 26, 2024Updated last year
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆424Sep 24, 2025Updated 4 months ago
- ☆85Mar 11, 2025Updated 11 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 4 months ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆16Jul 7, 2021Updated 4 years ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆121Jan 29, 2026Updated 2 weeks ago
- [CVPR 2025 Highlight] InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment☆43Jun 29, 2025Updated 7 months ago
- Dynamic Surface Function Networks for Clothed Human Bodies☆37Apr 12, 2021Updated 4 years ago
- Build a skeleton using Blender and register it to human mesh.☆16May 29, 2022Updated 3 years ago
- Collection of Acceleration Methods for Generative AI☆29Dec 9, 2025Updated 2 months ago
- SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction (Dataset, Contains proposed top baseline reconstructions with estimate…☆21Dec 18, 2023Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 6 months ago
- [CVPR 2023] Official implementation of "Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU…☆20Jun 9, 2023Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- ☆40Dec 16, 2025Updated 2 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆307Mar 12, 2025Updated 11 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- Code for the CVPR'23 paper: "STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition"☆21Dec 9, 2024Updated last year
- Offical code for ICCV2023 InterPrior☆20Oct 2, 2023Updated 2 years ago
- An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation☆1,522Oct 16, 2025Updated 3 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 6 months ago
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- DAGM GCPR 2023 Paper: HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture☆27Feb 3, 2024Updated 2 years ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆626Feb 6, 2026Updated last week
- Code for one-stage adaptive set-based HOI detector AS-Net.☆52May 8, 2021Updated 4 years ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆111Dec 4, 2025Updated 2 months ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆29Apr 23, 2025Updated 9 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆53Feb 2, 2026Updated last week
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆50Jan 5, 2026Updated last month
- ☆65Aug 9, 2024Updated last year
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆262Apr 7, 2025Updated 10 months ago
- ☆27Oct 5, 2023Updated 2 years ago
- A curated list of recent efficient video generation methods.☆54Oct 7, 2025Updated 4 months ago
- [ECCV 2024] Official PyTorch implement of paper "ParCo: Part-Coordinating Text-to-Motion Synthesis": http://arxiv.org/abs/2403.18512☆71Sep 30, 2025Updated 4 months ago