chuzhumin98 / PRELinks
A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs
☆18Updated last year
Alternatives and similar repositories for PRE
Users that are interested in PRE are comparing it to the libraries listed below
Sorting:
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆59Updated 4 months ago
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆26Updated 2 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆71Updated 4 months ago
- Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation☆20Updated 6 months ago
- YuLan-IR: Information Retrieval Boosted LMs☆221Updated last year
- The demo, code and data of FollowRAG☆74Updated 2 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 10 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆71Updated 9 months ago
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆30Updated 8 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆140Updated 4 months ago
- A curated list of resources dedicated to retrieval-augmented generation (RAG).☆120Updated this week
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆184Updated 11 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆137Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆108Updated 10 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆235Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆62Updated 8 months ago
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆92Updated 2 years ago
- ☆99Updated 11 months ago
- Generative Judge for Evaluating Alignment☆245Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆57Updated last year
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆43Updated 11 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆123Updated 7 months ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Updated 2 years ago
- A framework for editing the CoTs for better factuality☆51Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆167Updated last week
- Code implementation of synthetic continued pretraining☆129Updated 8 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆93Updated 7 months ago
- ☆30Updated 3 weeks ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆73Updated last year