NuoJohnChen/JudgeLRM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NuoJohnChen/JudgeLRM)

NuoJohnChen / JudgeLRM

JudgeLRM: Large Reasoning Models as a Judge

☆42

Alternatives and similar repositories for JudgeLRM

Users that are interested in JudgeLRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pratyushmaini / ssft
View on GitHub
[NeurIPS'22] Official Repository for Characterizing Datapoints via Second-Split Forgetting
☆16Aug 11, 2023Updated 2 years ago
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆13Apr 9, 2025Updated last year
Xtra-Computing / LLM-DNA
View on GitHub
[ICLR'26 Oral] LLM DNA: Tracing Model Evolution via Functional Representations
☆32Apr 8, 2026Updated 3 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
NuoJohnChen / XtraGPT
View on GitHub
[ACL 2026 Main] XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
☆25Apr 23, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
princeton-pli / what-makes-good-rm
View on GitHub
[NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective
☆44Sep 18, 2025Updated 10 months ago
sen-ye / R3
View on GitHub
[ICLR26] Understanding VS. Generation: Navigating Optimization Dilemma in Multimodal Models
☆25May 6, 2026Updated 2 months ago
FanZT6 / FairMT-bench
View on GitHub
☆14Mar 7, 2025Updated last year
Geaming2002 / Ruler
View on GitHub
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆41Sep 30, 2024Updated last year
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
CrawlScript / MMClaw
View on GitHub
Ultra-Lightweight, Pure Python Multimodal Agent.
☆145Jul 11, 2026Updated last week
ablghtianyi / ICL_Modular_Arithmetic
View on GitHub
☆19Mar 25, 2025Updated last year
fansunqi / VideoTool
View on GitHub
Official Repository for NeurIPS'25 Paper "Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task"
☆23May 18, 2026Updated 2 months ago
yl4467 / singer
View on GitHub
☆15Feb 22, 2025Updated last year
bethgelab / delta-belief-rl
View on GitHub
Official implementation of the ΔBelief-RL method.
☆31Feb 28, 2026Updated 4 months ago
facebookresearch / rlfh-gen-div
View on GitHub
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
☆50Jan 19, 2024Updated 2 years ago
SeanLeng1 / Reward-Calibration
View on GitHub
☆21Dec 14, 2024Updated last year
lamps-lab / Patent-figure-segmentor
View on GitHub
☆14Aug 12, 2022Updated 3 years ago
gaocegege / xuruowei-forever
View on GitHub
https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生命的轨迹漫步，重新触摸那些有温度的瞬间。
☆28Apr 1, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
RUCKBReasoning / CodeRM
View on GitHub
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'
☆27May 16, 2025Updated last year
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
thu-coai / BARREL
View on GitHub
[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
☆18May 21, 2025Updated last year
tianyi-lab / MiP-Overthinking
View on GitHub
[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
☆39Jun 5, 2025Updated last year
clinicalml / co-llm
View on GitHub
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆128May 7, 2024Updated 2 years ago
Reza-esfandiarpoor / the-mcp-company
View on GitHub
TheMCPCompany: Creating General-purpose Agents with Task-specific Tools
☆16Dec 19, 2025Updated 7 months ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆126May 6, 2025Updated last year
LivingFutureLab / DeltaBench
View on GitHub
☆45Mar 4, 2025Updated last year
Linear95 / APO
View on GitHub
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
☆54Jun 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
teddysum / korean_evaluation
View on GitHub
☆10Jun 5, 2025Updated last year
PiggyJerry / DC-Net
View on GitHub
The code for paper: "DC-Net: Divide-and-Conquer for Salient Object Detection"
☆22Aug 30, 2024Updated last year
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year