[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆389Mar 26, 2025Updated 11 months ago
Alternatives and similar repositories for VisionReward
Users that are interested in VisionReward are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,650Oct 29, 2025Updated 4 months ago
- Official Implementation of VideoDPO☆161Jun 1, 2025Updated 9 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆437Sep 24, 2025Updated 5 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆670Nov 10, 2025Updated 4 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆740Mar 7, 2026Updated last week
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆649May 24, 2024Updated last year
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆2,073Nov 4, 2025Updated 4 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆279Dec 5, 2025Updated 3 months ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆265Apr 7, 2025Updated 11 months ago
- ☆582Dec 21, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆244Apr 6, 2024Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆113Dec 4, 2025Updated 3 months ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆718Feb 10, 2026Updated last month
- An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation☆1,551Oct 16, 2025Updated 5 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆154Jun 25, 2024Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆434Mar 3, 2025Updated last year
- [ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.☆408Feb 6, 2026Updated last month
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,532Mar 13, 2026Updated last week
- ☆200Jul 12, 2024Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 10 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆859May 23, 2025Updated 9 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆35Jan 26, 2026Updated last month
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆314Nov 1, 2024Updated last year
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆475Dec 6, 2025Updated 3 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆635Oct 29, 2025Updated 4 months ago
- Q-Insight is open-sourced at https://github.com/bytedance/Q-Insight. This repository will not receive further updates.☆142May 30, 2025Updated 9 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆1,262Mar 5, 2025Updated last year
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 9 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,154Updated this week
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆432Sep 18, 2025Updated 6 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- Scalable and memory-optimized training of diffusion models☆1,344Jun 4, 2025Updated 9 months ago
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆19Feb 25, 2026Updated 3 weeks ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,553Nov 10, 2025Updated 4 months ago
- ☆572Nov 26, 2024Updated last year
- ☆54May 6, 2025Updated 10 months ago