naver / bergenLinks
Benchmarking library for RAG
☆235Updated 3 weeks ago
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆205Updated 10 months ago
- Comprehensive benchmark for RAG☆231Updated 4 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆205Updated 4 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆163Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 10 months ago
- ☆155Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆169Updated last month
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆159Updated 2 weeks ago
- Finetune mistral-7b-instruct for sentence embeddings☆86Updated last year
- Document Ranking with Large Language Models.☆191Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆216Updated last week
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆306Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆198Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆48Updated last year
- Scalable training for dense retrieval models.☆297Updated 4 months ago
- Test-time compute in information retrieval☆46Updated 3 months ago
- Retrieval-Augmented Generation battle!☆59Updated 3 months ago
- ☆150Updated 2 weeks ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆345Updated 10 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆545Updated this week
- ☆189Updated 3 months ago
- ☆292Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆218Updated 4 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆144Updated 11 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆93Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆108Updated 3 weeks ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆248Updated 2 years ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆218Updated last year