jinzhuoran / RAG-RewardBench
View external linksLinks

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

☆16

Alternatives and similar repositories for RAG-RewardBench

Users that are interested in RAG-RewardBench are comparing it to the libraries listed below

Sorting:

Zhitao-He / AgentsCourt
View on GitHub
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)
☆15Dec 30, 2024Updated last year
chenlong-clock / RULE-Unlearn
View on GitHub
[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆19Oct 22, 2025Updated 3 months ago
jinzhuoran / MiNer
View on GitHub
A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022
☆11Feb 1, 2023Updated 3 years ago
HongbangYuan / OmniReward
View on GitHub
☆40Dec 16, 2025Updated 2 months ago
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆69May 13, 2025Updated 9 months ago
jinzhuoran / RWKU
View on GitHub
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆90Sep 30, 2024Updated last year
GaryStack / MMR-V
View on GitHub
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆38Jun 23, 2025Updated 7 months ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)
☆27Oct 3, 2025Updated 4 months ago
THU-KEG / PairJudgeRM
View on GitHub
☆14Apr 14, 2025Updated 10 months ago
MDI-Benchmark / MDI-Benchmark
View on GitHub
☆14Dec 18, 2024Updated last year
GaryStack / Trustworthy-Evaluation
View on GitHub
Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)
☆19Jul 19, 2025Updated 6 months ago
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆17Nov 4, 2025Updated 3 months ago
CPF-NLPR / ULGN4DocEFI
View on GitHub
☆10Nov 14, 2021Updated 4 years ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 3 months ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆32Jan 23, 2025Updated last year
CogNLP / CogKTR
View on GitHub
CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding. EMNLP 2022
☆31Oct 14, 2022Updated 3 years ago
gautierdag / plancraft
View on GitHub
Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs
☆26Nov 7, 2025Updated 3 months ago
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 4 months ago
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated last year
JingMog / THOR
View on GitHub
Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆31Sep 19, 2025Updated 4 months ago
jinzhuoran / CogKGE
View on GitHub
CogKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge. ACL 2022
☆57Jun 5, 2022Updated 3 years ago
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆16Oct 20, 2025Updated 3 months ago
DLYuanGod / EfficientLLM
View on GitHub
☆23May 21, 2025Updated 8 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 6, 2024Updated last year
ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools
View on GitHub
A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.
☆45Dec 17, 2025Updated last month
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 5 months ago
CogNLP / KENLU-Papers
View on GitHub
An awesome repository for knowledge-enhanced natural language understanding resources, including related papers, codes and datasets.
☆18Sep 21, 2022Updated 3 years ago
linhaowei1 / kumo
View on GitHub
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
☆19Jun 4, 2025Updated 8 months ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆19Mar 4, 2025Updated 11 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
Blackzxy / LoGAH
View on GitHub
☆23Sep 29, 2024Updated last year
X-GenGroup / PaCo-RL
View on GitHub
Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*
☆31Dec 13, 2025Updated 2 months ago
INK-USC / FiD-ICL
View on GitHub
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆15Jul 24, 2023Updated 2 years ago
csbench / csbench
View on GitHub
☆46Oct 28, 2025Updated 3 months ago
IBM / NL2PDDL
View on GitHub
this is for fun, ain't it grand!
☆21Sep 18, 2025Updated 4 months ago
Hao840 / ADEM-VL
View on GitHub
PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"
☆21Oct 28, 2024Updated last year
Tim-Siu / reinforcement-distillation
View on GitHub
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆32Jul 25, 2025Updated 6 months ago
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆21Oct 10, 2024Updated last year
LLMkvsys / rethink-kv-compression
View on GitHub
☆22Mar 7, 2025Updated 11 months ago

jinzhuoran / RAG-RewardBenchView external linksLinks

Alternatives and similar repositories for RAG-RewardBench

jinzhuoran / RAG-RewardBench
View external linksLinks