Benchmarking library for RAG
☆272Mar 11, 2026Updated 2 months ago
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- Document Ranking with Large Language Models.☆210Feb 14, 2026Updated 3 months ago
- ☆19May 16, 2024Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆994May 3, 2024Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆594May 18, 2026Updated last week
- Official repository of the Seismic library.☆125Apr 8, 2026Updated last month
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,076Updated this week
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆217Apr 14, 2026Updated last month
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆33Dec 2, 2025Updated 5 months ago
- Inquisitive Parrots for Search☆200Jun 5, 2025Updated 11 months ago
- Large language models for document ranking.☆75Apr 16, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated 2 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆208Jul 31, 2024Updated last year
- Provides a common interface to many IR ranking datasets.☆390Apr 10, 2026Updated last month
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Mar 29, 2026Updated last month
- ☆18Aug 21, 2025Updated 9 months ago
- ☆63Jan 26, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆738May 18, 2026Updated last week
- Performs benchmarking on two Korean datasets with minimal time and effort.☆45Jan 22, 2026Updated 4 months ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated 2 years ago
- Retrieval-Augmented Generation battle!☆67Apr 18, 2026Updated last month
- CLIR version of ColBERT☆73Jun 23, 2025Updated 11 months ago
- MIRAGE is a light benchmark to evaluate RAG performance.☆37May 18, 2025Updated last year
- ☆14Jul 7, 2024Updated last year
- ☆45Apr 22, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multilingual Dialogue Datasets☆19Aug 18, 2022Updated 3 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,181Oct 16, 2025Updated 7 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 8 months ago
- Curation note of NLP datasets☆99Dec 6, 2022Updated 3 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆180Jul 4, 2024Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆81Feb 16, 2022Updated 4 years ago