baekingeol / Probing-RAGLinks
[NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"
☆18Updated 6 months ago
Alternatives and similar repositories for Probing-RAG
Users that are interested in Probing-RAG are comparing it to the libraries listed below
Sorting:
- Benchmarking library for RAG☆255Updated last week
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆142Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Updated 2 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆217Updated 7 months ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆25Updated last year
- A curated list of awesome papers about utilizing large language models for ranking.☆31Updated last year
- MIRAGE is a light benchmark to evaluate RAG performance.☆33Updated 8 months ago
- [ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues☆26Updated 7 months ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆27Updated 11 months ago
- 한국어 벤치마크 평가 코드 통합본(?)☆20Updated last year
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Updated 3 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆189Updated 4 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆42Updated last year
- The Universe of Evaluation. All about the evaluation for LLMs.☆232Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Updated last year
- ☆519Updated 6 months ago
- ☆20Updated last year
- [NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage☆16Updated 5 months ago
- The most modern LLM evaluation toolkit☆70Updated 3 months ago
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆104Updated last week
- ☆52Updated 8 months ago
- ☆42Updated last year
- Large language models for document ranking.☆71Updated 3 weeks ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆136Updated last year
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆183Updated 2 months ago
- ☆22Updated 7 months ago
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆54Updated 2 years ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64Updated 8 months ago
- ☆61Updated 8 months ago
- Comprehensive benchmark for RAG☆260Updated 7 months ago