qinchuanhui / UDA-Benchmark
[NIPS'24] UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
☆31Updated 3 weeks ago
Alternatives and similar repositories for UDA-Benchmark:
Users that are interested in UDA-Benchmark are comparing it to the libraries listed below
- Resources on Large Language Models for Table Processing☆95Updated 4 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆59Updated 5 months ago
- ☆66Updated 9 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆126Updated 8 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆179Updated 5 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆176Updated this week
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆23Updated 10 months ago
- This repository contains all the code for the DTS-SQL paper☆47Updated 7 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆194Updated 9 months ago
- ☆154Updated 6 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆114Updated 4 months ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆118Updated 4 months ago
- Repository of LV-Eval Benchmark☆59Updated 6 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆313Updated 5 months ago
- ☆72Updated 3 months ago
- Modular and structured prompt caching for low-latency LLM inference☆89Updated 4 months ago
- The repo for In-context Autoencoder☆112Updated 10 months ago
- A Survey on Data Selection for Language Models☆217Updated 5 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆226Updated last month
- ☆275Updated last year
- This is the repository for the generative information retrieval survey.☆154Updated 3 months ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆237Updated 4 months ago
- Implementation of "REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering"☆30Updated 3 months ago
- Contextual Harnessing for Efficient SQL Synthesis☆183Updated 4 months ago
- Collection of training data management explorations for large language models☆313Updated 7 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆42Updated last year
- Code implementation of synthetic continued pretraining☆93Updated 2 months ago
- ☆164Updated last year
- [ICLR2024] Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources☆63Updated 9 months ago