sjz5202/LLaVA-Reward

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sjz5202/LLaVA-Reward)

sjz5202 / LLaVA-Reward

Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation

☆26

Alternatives and similar repositories for LLaVA-Reward

Users that are interested in LLaVA-Reward are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SAIS-FUXI / EvalAlign
View on GitHub
☆19Oct 23, 2024Updated last year
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
sen-mao / FasterDiffusion-DiT
View on GitHub
Official Implementations "Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference" for DiT (NeurIPS'24)
☆15Aug 3, 2025Updated 11 months ago
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
sen-mao / SuppressEOT
View on GitHub
Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)
☆60Dec 3, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hecoding / Hyper-Modulation
View on GitHub
Official Implementation for "Transferring Unconditional to Conditional GANs with Hyper-Modulation" CVPRW 22 https://arxiv.org/abs/2112.02…
☆13Jun 28, 2022Updated 4 years ago
wangkai930418 / attndistill
View on GitHub
code for our paper "Attention Distillation: self-supervised vision transformer students need more guidance" in BMVC 2022
☆17Oct 4, 2022Updated 3 years ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
Ji4chenLi / rg-lcd
View on GitHub
Reward Guided Latent Consistency Distillation
☆26Oct 9, 2024Updated last year
G-U-N / Diffusion-NPO
View on GitHub
[ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…
☆39Jan 26, 2026Updated 6 months ago
Franklin-Zhang0 / ReasonGen-R1
View on GitHub
Official respository for ReasonGen-R1
☆75Jun 23, 2025Updated last year
microsoft / SuperRL
View on GitHub
☆15Sep 8, 2025Updated 10 months ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
aniki-ly / FlowZero
View on GitHub
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax
☆18Nov 23, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
renjie3 / MemAttn
View on GitHub
☆16Feb 23, 2025Updated last year
babahui / Progressive-Text-to-Image
View on GitHub
☆15Sep 18, 2023Updated 2 years ago
Albert0147 / BAIT_SFUDA
View on GitHub
Unsupervised Domain Adaptation without Source Data by Casting a BAIT
☆23Sep 18, 2022Updated 3 years ago
feizc / Video-In-Context
View on GitHub
Video Diffusion Transformers are In-Context Learners
☆37Jan 6, 2025Updated last year
sen-mao / Loopfree
View on GitHub
[CVPR2025] Official Implementations "One-Way Ticket : Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models"
☆29Mar 16, 2026Updated 4 months ago
liujianzhi / EchoReel
View on GitHub
An innovative method designed to augment the capabilities of existing video diffusion models
☆22May 10, 2024Updated 2 years ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 11 months ago
Fredreic1849 / BranchGRPO
View on GitHub
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
☆47Oct 30, 2025Updated 8 months ago
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
j-min / VPGen
View on GitHub
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆57Jul 25, 2023Updated 3 years ago
takomc / amp
View on GitHub
【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"
☆22Sep 26, 2024Updated last year
yhZhai / mcm
View on GitHub
[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
☆71Oct 27, 2024Updated last year
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
CodeGoat24 / UnifiedReward
View on GitHub
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
☆796Jun 18, 2026Updated last month
mm-vl / ULM-R1
View on GitHub
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆48Jul 22, 2025Updated last year
Albert0147 / OneRing_SF-OPDA
View on GitHub
Code for 'OneRing: A Simple Method for Source-free Open-partial Domain Adaptation'
☆34Feb 16, 2023Updated 3 years ago
zhentao-zou / MURE
View on GitHub
Beyond Textual CoT: Interleaved Text-image chains with Deep Confidence Reasoning for Image Editing
☆19Jun 24, 2026Updated last month
ryylcc / OWSOL
View on GitHub
☆15Feb 18, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kaist-cvml / scribble-guided-diffusion
View on GitHub
[ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation
☆26Oct 2, 2024Updated last year
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
ByteDance-Seed / BM-code
View on GitHub
[Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions
☆45Jun 11, 2025Updated last year
ypwang61 / StoryEval
View on GitHub
[CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
☆21May 2, 2025Updated last year
ip-composer / IP-Composer
View on GitHub
☆20Apr 15, 2025Updated last year
g-luo / dual_process
View on GitHub
Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025
☆133Aug 29, 2025Updated 10 months ago
iLearn-Lab / ECCV2024-genview
View on GitHub
[ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".
☆29Dec 18, 2024Updated last year