THU-KEG/PairJudgeRM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THU-KEG/PairJudgeRM)

THU-KEG / PairJudgeRM

☆15

Alternatives and similar repositories for PairJudgeRM

Users that are interested in PairJudgeRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
aster2024 / SWIFT
View on GitHub
Source code for SWIFT, an efficient reward model.
☆21Jan 13, 2026Updated 6 months ago
TREC-RAG / trec-rag.github.io
View on GitHub
Website for TREC RAG
☆14Jul 19, 2026Updated last week
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SkyworkAI / skywork-o1-prm-inference
View on GitHub
☆69Nov 26, 2024Updated last year
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
THU-KEG / RM-Bench
View on GitHub
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
☆84Jul 18, 2025Updated last year
kyunghyuncho / jax-practice
View on GitHub
☆13Aug 17, 2020Updated 5 years ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 5 months ago
lgresearch / QASA
View on GitHub
☆33Oct 30, 2023Updated 2 years ago
rempsyc / starter-academic
View on GitHub
My personal site, using Wowchemy
☆13Updated this week
euclid-multimodal / Euclid
View on GitHub
☆18Jan 9, 2025Updated last year
xiusic / MinPrompt
View on GitHub
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
☆14May 3, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
johndpope / ltx2-castlehill
View on GitHub
CastleHill: Separable Causal Diffusion / Varitaion Flow Maps for LTX-2 long-form video generation
☆15May 19, 2026Updated 2 months ago
wenlinyao / HDFlow
View on GitHub
Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
☆15Oct 4, 2024Updated last year
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
leloykun / steepest-descent-lean
View on GitHub
Deriving steepest descent convergence bounds and hyperparameter scaling laws in machine learning optimization from first principles, form…
☆16Apr 11, 2026Updated 3 months ago
Kurt232 / RLKV
View on GitHub
☆36Jun 8, 2026Updated last month
divelab / Sys2Bench
View on GitHub
Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…
☆31Mar 5, 2025Updated last year
lujiaxuan0520 / Test-Time-Tool-Evol
View on GitHub
Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.
☆43Jan 13, 2026Updated 6 months ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
zeyofu / ReFocus_Code
View on GitHub
Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]
☆50Jul 22, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
pierrefdz / stable_signature
View on GitHub
Please go to https://github.com/facebookresearch/stable_signature
☆14Jul 26, 2023Updated 3 years ago
Yu-chen-Deng / LAPIG
View on GitHub
[TVCG & VR'25] LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization
☆11Apr 16, 2026Updated 3 months ago
bubble65 / DLLM-Searcher
View on GitHub
DLLM-Searcher has been accepted by SIGIR 2026! 🥳
☆33Jan 23, 2026Updated 6 months ago
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
alexmartin1722 / wikivideo
View on GitHub
WikiVideo: Article Generation from Multiple Videos
☆15Nov 14, 2025Updated 8 months ago
URRealHero / JudgeAnything
View on GitHub
☆17Jun 1, 2025Updated last year
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 7 months ago
PKU-AICare / ConfAgents
View on GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis
☆15Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Alrightlone / SparAlloc
View on GitHub
SparAlloc: A Simple and Modular Framework for Decoupled Sparsity Allocation in Layerwise Pruning for LLM
☆16Jun 5, 2025Updated last year
yansheng-qiu / AI_Idea_Bench_2025
View on GitHub
☆16May 15, 2025Updated last year
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
jiaosiyuu / ThinkGen
View on GitHub
ThinkGen: Generalized Thinking for Visual Generation
☆61Dec 30, 2025Updated 6 months ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
smallporridge / TrustworthyRAG
View on GitHub
☆16May 18, 2026Updated 2 months ago
MohamedFawzy / recommendation-engine
View on GitHub
Recommendation engine and it's algorithms in python , R .
☆12Oct 26, 2018Updated 7 years ago