chuzhumin98 / PRELinks
A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs
☆19Updated last year
Alternatives and similar repositories for PRE
Users that are interested in PRE are comparing it to the libraries listed below
Sorting:
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆153Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆221Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆59Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆139Updated last year
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆58Updated 7 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆138Updated 11 months ago
- An Open-Source Package for Information Retrieval☆166Updated this week
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆71Updated 5 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆236Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆58Updated last year
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆26Updated 2 years ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆95Updated 7 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆140Updated 5 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆73Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆69Updated 5 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆72Updated 9 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆82Updated 2 years ago
- The demo, code and data of FollowRAG☆75Updated 3 months ago
- A curated list of awesome papers about information retrieval(IR) in the age of large language model(LLM). These include retrieval augment…☆77Updated last year
- Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation☆21Updated 7 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated last year
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆16Updated 4 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆184Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆167Updated last month
- ☆291Updated last year
- Code implementation of synthetic continued pretraining☆135Updated 9 months ago
- https://acl2023-retrieval-lm.github.io/☆156Updated 2 years ago
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>☆339Updated last year
- Code for the paper: Metacognitive Retrieval-Augmented Large Language Models