Benchmarking library for RAG
☆263Mar 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for bergen
Users that are interested in bergen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- ☆19May 16, 2024Updated last year
- Document Ranking with Large Language Models.☆205Feb 14, 2026Updated last month
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆984May 3, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆583Mar 12, 2026Updated last week
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆106Updated this week
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,040Updated this week
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆209Feb 26, 2026Updated 3 weeks ago
- Inquisitive Parrots for Search☆200Jun 5, 2025Updated 9 months ago
- Large language models for document ranking.☆71Jan 13, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆202Jul 31, 2024Updated last year
- Provides a common interface to many IR ranking datasets.☆386Feb 20, 2026Updated last month
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Oct 14, 2025Updated 5 months ago
- ☆18Aug 21, 2025Updated 7 months ago
- ☆60Jan 26, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Retrieval-Augmented Generation battle!☆64Updated this week
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated 2 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- CLIR version of ColBERT☆73Jun 23, 2025Updated 9 months ago
- ☆14Jul 7, 2024Updated last year
- ☆43Apr 22, 2025Updated 11 months ago
- Model implementation for the contextual embeddings project☆43Jun 2, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multilingual Dialogue Datasets☆19Aug 18, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,120Oct 16, 2025Updated 5 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 6 months ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Mar 6, 2024Updated 2 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆173Jul 4, 2024Updated last year