qinchuanhui / UDA-BenchmarkLinks
[NIPS'24] UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
☆42Updated 10 months ago
Alternatives and similar repositories for UDA-Benchmark
Users that are interested in UDA-Benchmark are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆49Updated 10 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆248Updated last year
- A High-Efficiency System of Large Language Model Based Search Agents☆74Updated 6 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆236Updated 3 months ago
- ☆93Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆95Updated 7 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆73Updated 7 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆215Updated 7 months ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆163Updated last year
- Comprehensive benchmark for RAG☆251Updated 6 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆84Updated last month
- PGRAG☆51Updated last year
- ☆105Updated last year
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆407Updated 8 months ago
- ☆228Updated last month
- Code for Parametric RAG, SIGIR 2025 Full Paper☆215Updated 8 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆144Updated last week
- ☆35Updated 10 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆141Updated 10 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆143Updated 7 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆167Updated last year
- [NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆92Updated last year
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆198Updated 2 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆193Updated last year
- Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs (ACL 2024)☆292Updated last year
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆75Updated 2 years ago
- 🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers☆130Updated 8 months ago
- ☆310Updated 5 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆92Updated last year
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆310Updated 2 months ago