naver / bergen
Benchmarking library for RAG
☆193Updated last week
Alternatives and similar repositories for bergen:
Users that are interested in bergen are comparing it to the libraries listed below
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆170Updated 4 months ago
- Comprehensive benchmark for RAG☆167Updated 5 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆99Updated last week
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆133Updated 5 months ago
- ☆145Updated last year
- Inquisitive Parrots for Search☆190Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 5 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆436Updated this week
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆128Updated 10 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆480Updated 6 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆126Updated last week
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆106Updated 6 months ago
- ☆174Updated 2 years ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 4 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆203Updated 10 months ago
- Retrieval-Augmented Generation battle!☆49Updated 4 months ago
- Document Ranking with Large Language Models.☆147Updated this week
- Scalable training for dense retrieval models.☆292Updated 2 months ago
- awesome synthetic (text) datasets☆272Updated 5 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆137Updated 4 months ago
- CLIR version of ColBERT☆68Updated last month
- LOFT: A 1 Million+ Token Long-Context Benchmark☆190Updated this week
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆323Updated 4 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated 2 weeks ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆153Updated last year
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆300Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆192Updated last year