Benchmarking library for RAG
☆273Mar 11, 2026Updated 3 months ago
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- Document Ranking with Large Language Models.☆210Feb 14, 2026Updated 4 months ago
- ☆19May 16, 2024Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 8 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆36Nov 21, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆995May 3, 2024Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆604Jun 7, 2026Updated last week
- Official repository of the Seismic library.☆129Apr 8, 2026Updated 2 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,086Jun 8, 2026Updated last week
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆221Apr 14, 2026Updated 2 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆34Dec 2, 2025Updated 6 months ago
- Inquisitive Parrots for Search☆200Jun 5, 2025Updated last year
- Large language models for document ranking.☆75May 20, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated 2 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆210Jul 31, 2024Updated last year
- Provides a common interface to many IR ranking datasets.☆390May 28, 2026Updated 2 weeks ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Mar 29, 2026Updated 2 months ago
- ☆18Aug 21, 2025Updated 9 months ago
- ☆63Jan 26, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆741May 18, 2026Updated 3 weeks ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆45Jan 22, 2026Updated 4 months ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆32Apr 24, 2024Updated 2 years ago
- Retrieval-Augmented Generation battle!☆67Apr 18, 2026Updated last month
- CLIR version of ColBERT☆73May 28, 2026Updated 2 weeks ago
- MIRAGE is a light benchmark to evaluate RAG performance.☆37May 18, 2025Updated last year
- ☆14Jul 7, 2024Updated last year
- ☆45Apr 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Multilingual Dialogue Datasets☆19Aug 18, 2022Updated 3 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,211Oct 16, 2025Updated 8 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆740Sep 18, 2025Updated 8 months ago
- Curation note of NLP datasets☆99Dec 6, 2022Updated 3 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆181Jul 4, 2024Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆82Feb 16, 2022Updated 4 years ago