qinchuanhui / UDA-BenchmarkLinks
[NIPS'24] UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
☆41Updated 8 months ago
Alternatives and similar repositories for UDA-Benchmark
Users that are interested in UDA-Benchmark are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆46Updated 9 months ago
- [NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆90Updated last year
- Comprehensive benchmark for RAG☆232Updated 4 months ago
- ☆84Updated 11 months ago
- A High-Efficiency System of Large Language Model Based Search Agents☆75Updated 4 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆230Updated last month
- A Comprehensive Library for Memory of LLM-based Agents.☆85Updated 5 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆157Updated last year
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆71Updated 6 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆111Updated 2 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆132Updated 8 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated last year
- ☆104Updated 11 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆201Updated 5 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆247Updated last year
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆177Updated last year
- Repository of LV-Eval Benchmark☆70Updated last year
- ☆239Updated last year
- ☆151Updated 3 weeks ago
- This is the code of MMOA-RAG.☆83Updated 5 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆187Updated last year
- ☆37Updated 9 months ago
- e☆41Updated 6 months ago
- Code implementation of synthetic continued pretraining☆137Updated 10 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆131Updated 7 months ago
- PerLTQA is a new benchmark for memory classification, retrieval, and synthesis of Large Language Models☆26Updated 3 months ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆109Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆187Updated 2 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆204Updated 6 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆128Updated 9 months ago