naver / bergenLinks
Benchmarking library for RAG
☆213Updated last month
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below
Sorting:
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆176Updated 3 weeks ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆191Updated 7 months ago
- Comprehensive benchmark for RAG☆199Updated last month
- Document Ranking with Large Language Models.☆169Updated last month
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 7 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆160Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆150Updated 2 months ago
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆300Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆494Updated last week
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆90Updated 8 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆192Updated last year
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆148Updated 7 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆85Updated last year
- The Universe of Evaluation. All about the evaluation for LLMs.☆224Updated last year
- Complex Function Calling Benchmark.☆118Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- ☆151Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- ☆123Updated 4 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆244Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆206Updated last month
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆217Updated last year
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆134Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆66Updated last year
- awesome synthetic (text) datasets☆289Updated last week
- Scalable training for dense retrieval models.☆299Updated last month
- Inquisitive Parrots for Search☆193Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆139Updated 8 months ago
- Retrieval-Augmented Generation battle!☆52Updated 7 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆90Updated 7 months ago