facebookresearch/multimodal_rewardbench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/multimodal_rewardbench)

facebookresearch / multimodal_rewardbench

Multimodal RewardBench

☆68

Alternatives and similar repositories for multimodal_rewardbench

Users that are interested in multimodal_rewardbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vl-rewardbench / VL_RewardBench
View on GitHub
☆29Jul 23, 2025Updated last year
RifleZhang / LLaVA-Reasoner-DPO
View on GitHub
☆116Jan 8, 2025Updated last year
si0wang / VisVM
View on GitHub
☆46Dec 30, 2024Updated last year
viswavi / RLCF
View on GitHub
☆24Oct 23, 2025Updated 9 months ago
SkyworkAI / Skywork-Reward-V2
View on GitHub
Scaling Preference Data Curation via Human-AI Synergy
☆152Jul 3, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Coobiw / IE-Critic-R1
View on GitHub
IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment
☆19Nov 26, 2025Updated 8 months ago
ugonfor / DGQ
View on GitHub
[ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models
☆19Mar 25, 2025Updated last year
yfzhang114 / r1_reward
View on GitHub
✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
☆291May 9, 2025Updated last year
MAmmoTH-VL / MAmmoTH-VL
View on GitHub
(ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆50Jun 4, 2025Updated last year
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated last year
yuhui-zh15 / AutoConverter
View on GitHub
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆40May 26, 2025Updated last year
Kwai-YuanQi / MM-RLHF
View on GitHub
The Next Step Forward in Multimodal LLM Alignment
☆199May 1, 2025Updated last year
THU-KEG / Crab
View on GitHub
[CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models
☆18May 23, 2025Updated last year
RUCAIBox / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆110May 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaist-cvml / scribble-guided-diffusion
View on GitHub
[ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation
☆26Oct 2, 2024Updated last year
Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 9 months ago
ls-kelvin / REVPT
View on GitHub
Code for paper: Reinforced Vision Perception with Tools
☆74Oct 3, 2025Updated 9 months ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
Liuziyu77 / MIA-DPO
View on GitHub
Official implement of MIA-DPO
☆69Jan 23, 2025Updated last year
zlab-princeton / vero
View on GitHub
Vero: An Open RL Recipe for General Visual Reasoning
☆137Jun 19, 2026Updated last month
lcysyzxdxc / AGIQA-1k-Database
View on GitHub
AGIQA-1k-Database for AI Generated Content Image Quality Assessment
☆30May 1, 2023Updated 3 years ago
holarissun / RewardModelingBeyondBradleyTerry
View on GitHub
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆73Apr 2, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
uclanlp / OpenVLThinker
View on GitHub
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆155May 25, 2026Updated 2 months ago
weizhou-geek / SFSN
View on GitHub
Implementation of QoMEX 2021 "Image Super-Resolution Quality Assessment: Structural Fidelity Versus Statistical Naturalness"
☆16Sep 28, 2022Updated 3 years ago
InternLM / ARC-VL
View on GitHub
[CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"
☆46Nov 26, 2025Updated 8 months ago
si0wang / ThinkLite-VL
View on GitHub
☆105Jun 10, 2025Updated last year
waterhorse1 / Natural-language-RL
View on GitHub
Natural Language Reinforcement Learning
☆101Jul 30, 2025Updated 11 months ago
InternLM / Spatial-SSRL
View on GitHub
[CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
☆133Apr 7, 2026Updated 3 months ago
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
qq456cvb / 3DCorrEnhance
View on GitHub
☆37Jun 13, 2026Updated last month
FanqingM / MM-Eureka-V0
View on GitHub
MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka
☆325Jun 21, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NuoJohnChen / JudgeLRM
View on GitHub
JudgeLRM: Large Reasoning Models as a Judge
☆42May 6, 2026Updated 2 months ago
wwxu21 / CUT
View on GitHub
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Feb 29, 2024Updated 2 years ago
Cooperx521 / ScaleCap
View on GitHub
(ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
☆60Jan 26, 2026Updated 6 months ago
agents-x-project / PyVision
View on GitHub
[MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."
☆162Jul 22, 2025Updated last year
GAIR-NLP / MetaCritique
View on GitHub
Evaluate the Quality of Critique
☆37Jun 1, 2024Updated 2 years ago
ruixin31 / Spurious_Rewards
View on GitHub
☆361Jul 29, 2025Updated 11 months ago
MJ-Bench / MJ-Bench
View on GitHub
(NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
☆51Jun 3, 2025Updated last year