naver / bergen
Benchmarking library for RAG
☆196Updated this week
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆172Updated 5 months ago
- Comprehensive benchmark for RAG☆180Updated 6 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 5 months ago
- ☆147Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆157Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆132Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆199Updated last week
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆114Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆445Updated last week
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆140Updated 5 months ago
- Inquisitive Parrots for Search☆191Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆240Updated last year
- Document Ranking with Large Language Models.☆155Updated 3 weeks ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 9 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Scalable training for dense retrieval models.☆292Updated 2 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆206Updated 11 months ago
- ☆174Updated 2 years ago
- ☆43Updated 3 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆80Updated last year
- ☆279Updated last year
- Benchmark baseline for retrieval qa applications☆110Updated last year
- code for training & evaluating Contextual Document Embedding models☆189Updated this week
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆256Updated 10 months ago
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆130Updated this week
- LOFT: A 1 Million+ Token Long-Context Benchmark☆193Updated 3 weeks ago