naver / bergenLinks
Benchmarking library for RAG
☆222Updated last month
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆196Updated 8 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆193Updated 2 months ago
- Comprehensive benchmark for RAG☆211Updated 2 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆160Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆153Updated last month
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆525Updated this week
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 8 months ago
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆305Updated last year
- Document Ranking with Large Language Models.☆183Updated 3 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆43Updated last year
- ☆154Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆132Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆86Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆194Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆161Updated 3 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆247Updated last year
- Retrieval-Augmented Generation battle!☆58Updated last month
- Scalable training for dense retrieval models.☆299Updated 2 months ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆340Updated 8 months ago
- ☆50Updated 7 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆219Updated last year
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆140Updated 3 months ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆227Updated last year
- ☆286Updated last year
- Complex Function Calling Benchmark.☆124Updated 7 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆213Updated 3 weeks ago
- Inquisitive Parrots for Search☆195Updated 2 months ago
- ☆132Updated 5 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 10 months ago