THU-KEG / R-EvalLinks
[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
☆12Updated last year
Alternatives and similar repositories for R-Eval
Users that are interested in R-Eval are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 months ago
- ☆22Updated 2 months ago
- [EMNLP 2024] Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models☆37Updated 10 months ago
- ☆22Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Updated 5 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆21Updated 11 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆19Updated 4 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆36Updated 6 months ago
- ☆29Updated last year
- ☆74Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆25Updated 10 months ago
- ☆35Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆51Updated 2 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆124Updated 8 months ago
- Evaluate the Quality of Critique☆36Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 4 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆82Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated 11 months ago
- ☆46Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- Code for Benchmarking Language Model Agents for Data-Driven Science☆32Updated 11 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Updated 2 years ago
- Code for the paper: Metacognitive Retrieval-Augmented Large Language Models☆34Updated last year
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆52Updated last year
- ☆33Updated 11 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆28Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆15Updated 6 months ago