Benchmarking library for RAG
☆266Mar 11, 2026Updated last month
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- ☆19May 16, 2024Updated last year
- Document Ranking with Large Language Models.☆206Feb 14, 2026Updated 2 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆990May 3, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆586Apr 4, 2026Updated last week
- Official repository of the Seismic library.☆118Apr 8, 2026Updated last week
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆31Dec 2, 2025Updated 4 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,046Updated this week
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆210Apr 4, 2026Updated last week
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 10 months ago
- Large language models for document ranking.☆72Jan 13, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆204Jul 31, 2024Updated last year
- Provides a common interface to many IR ranking datasets.☆386Feb 20, 2026Updated last month
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Mar 29, 2026Updated 2 weeks ago
- ☆18Aug 21, 2025Updated 7 months ago
- ☆60Jan 26, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Retrieval-Augmented Generation battle!☆64Mar 31, 2026Updated 2 weeks ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆733Jan 26, 2026Updated 2 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- CLIR version of ColBERT☆73Jun 23, 2025Updated 9 months ago
- ☆14Jul 7, 2024Updated last year
- ☆43Apr 22, 2025Updated 11 months ago
- Multilingual Dialogue Datasets☆19Aug 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,144Oct 16, 2025Updated 5 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 6 months ago
- Curation note of NLP datasets☆99Dec 6, 2022Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆174Jul 4, 2024Updated last year
- ☆14Jan 10, 2025Updated last year