☆217Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for DenseRetrieval
Users that are interested in DenseRetrieval are comparing it to the libraries listed below
Sorting:
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆730Jan 26, 2026Updated last month
- A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).☆677Jan 7, 2024Updated 2 years ago
- An Open-Source Package for Information Retrieval☆168Updated this week
- This is the repo for the survey of LLM4IR.☆530Nov 13, 2025Updated 3 months ago
- An all-in-one framework for Ad-hoc Information Retrieval.☆18Apr 3, 2024Updated last year
- SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.☆128Feb 15, 2022Updated 4 years ago
- ☆170Oct 20, 2023Updated 2 years ago
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆42Dec 9, 2021Updated 4 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆142Jan 15, 2024Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Codebase for RetroMAE and beyond.☆272Jun 7, 2024Updated last year
- Provides a common interface to many IR ranking datasets.☆381Feb 20, 2026Updated 2 weeks ago
- ☆718Oct 7, 2025Updated 5 months ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 3 years ago
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆337Jun 17, 2023Updated 2 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,095Oct 16, 2025Updated 4 months ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆24Aug 21, 2024Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆220Mar 4, 2024Updated 2 years ago
- CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks☆34Aug 31, 2022Updated 3 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,026Updated this week
- ☆15Aug 2, 2021Updated 4 years ago
- Collections of IR Research☆37May 18, 2025Updated 9 months ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 8 months ago
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆383Jan 6, 2026Updated 2 months ago
- ☆74Feb 22, 2023Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 9 months ago
- ☆70Jun 16, 2022Updated 3 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆162Jul 3, 2023Updated 2 years ago
- The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shen…☆126Jul 9, 2023Updated 2 years ago
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆64May 7, 2024Updated last year
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- ☆12Oct 28, 2024Updated last year
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- ☆47Mar 27, 2022Updated 3 years ago