myscale / Retrieval-QA-Benchmark
Benchmark baseline for retrieval qa applications
☆108Updated last year
Alternatives and similar repositories for Retrieval-QA-Benchmark:
Users that are interested in Retrieval-QA-Benchmark are comparing it to the libraries listed below
- ☆218Updated 8 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- ☆278Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆257Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆169Updated 4 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆104Updated 5 months ago
- Comprehensive benchmark for RAG☆164Updated 5 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆134Updated 3 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆68Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆96Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆43Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆150Updated last year
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆201Updated 10 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆192Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆237Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆181Updated 6 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆81Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆133Updated 5 months ago
- ☆175Updated 2 years ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆149Updated 4 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆137Updated 5 months ago
- YuLan-IR: Information Retrieval Boosted LMs☆218Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆205Updated 5 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆125Updated 9 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆230Updated 10 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆135Updated 9 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago