RyanLiu112/Awesome-Process-Reward-Models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RyanLiu112/Awesome-Process-Reward-Models)

RyanLiu112 / Awesome-Process-Reward-Models

A comprehensive collection of process reward models.

☆176

Alternatives and similar repositories for Awesome-Process-Reward-Models

Users that are interested in Awesome-Process-Reward-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
mukhal / ThinkPRM
View on GitHub
[TMLR] Process Reward Models That Think
☆90Nov 29, 2025Updated 8 months ago
UW-Madison-Lee-Lab / VersaPRM
View on GitHub
☆37Feb 11, 2025Updated last year
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆190May 20, 2025Updated last year
CJReinforce / PURE
View on GitHub
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
☆172Oct 23, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JLZhong23 / awesome-reward-models
View on GitHub
☆170May 28, 2025Updated last year
NJUNLP / R-PRM
View on GitHub
☆34Apr 1, 2025Updated last year
TsinghuaC3I / Awesome-RL-for-LRMs
View on GitHub
A Survey of Reinforcement Learning for Large Reasoning Models
☆2,470Nov 9, 2025Updated 8 months ago
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
View on GitHub
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆1,437May 11, 2026Updated 2 months ago
RyanLiu112 / compute-optimal-tts
View on GitHub
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
☆288Feb 19, 2025Updated last year
euiin / SMART
View on GitHub
SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…
☆12Jul 9, 2025Updated last year
sanjibanc / agent_prm
View on GitHub
☆60Feb 19, 2025Updated last year
huggingface / Math-Verify
View on GitHub
☆1,172Jan 10, 2026Updated 6 months ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,535Apr 24, 2025Updated last year
ssmisya / PRMBench
View on GitHub
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆94Feb 15, 2025Updated last year
ThreeSR / Awesome-Inference-Time-Scaling
View on GitHub
Paper List of Inference/Test Time Scaling/Computing
☆400Jun 22, 2026Updated last month
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
openreasoner / openr
View on GitHub
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,848Jan 17, 2025Updated last year
GAIR-NLP / ToRL
View on GitHub
☆352May 24, 2025Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
hemingkx / Awesome-Efficient-Reasoning
View on GitHub
Paper list for Efficient Reasoning.
☆899May 29, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
XiaoYee / Awesome_Efficient_LRM_Reasoning
View on GitHub
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆357Jan 22, 2026Updated 6 months ago
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
idanshen / Value-Augmented-Sampling
View on GitHub
☆20May 16, 2024Updated 2 years ago
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
HKAIR-Lab / HK-O1aw
View on GitHub
☆43Nov 1, 2024Updated last year
RyanLiu112 / MRN
View on GitHub
[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…
☆26Feb 15, 2025Updated last year
NuoJohnChen / JudgeLRM
View on GitHub
JudgeLRM: Large Reasoning Models as a Judge
☆42May 6, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
0russwest0 / Awesome-Agent-RL
View on GitHub
☆511Oct 11, 2025Updated 9 months ago
thunlp / OPD
View on GitHub
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
☆855Jun 29, 2026Updated last month
zhaochen0110 / Awesome_Think_With_Images
View on GitHub
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,497Mar 9, 2026Updated 4 months ago
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,158Jun 9, 2026Updated last month
Chen-GX / SEER
View on GitHub
☆15Feb 10, 2025Updated last year
THU-KEG / RM-Bench
View on GitHub
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
☆84Jul 18, 2025Updated last year
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,026Jul 15, 2026Updated 2 weeks ago