A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
☆385Jan 6, 2026Updated 2 months ago
Alternatives and similar repositories for ANCE
Users that are interested in ANCE are comparing it to the libraries listed below
Sorting:
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,863Apr 6, 2023Updated 2 years ago
- An Open-Source Package for Information Retrieval.☆442Oct 7, 2022Updated 3 years ago
- SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.☆128Feb 15, 2022Updated 4 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- WSDM'2021, PROP and SIGIR'2021,B-PROP☆110May 18, 2023Updated 2 years ago
- Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"☆17Jan 10, 2022Updated 4 years ago
- NAACL2021 - COIL Contextualized Lexical Retriever☆157Jul 27, 2021Updated 4 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated last month
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆60Jul 11, 2021Updated 4 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- ☆45Oct 14, 2021Updated 4 years ago
- RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddings…☆66Oct 13, 2021Updated 4 years ago
- docTTTTTquery document expansion model☆374Mar 25, 2023Updated 2 years ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,036Mar 9, 2026Updated last week
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Build Text Rerankers with Deep Language Models☆265Feb 20, 2024Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆345Oct 10, 2023Updated 2 years ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 9 months ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,106Updated this week
- ☆15Aug 2, 2021Updated 4 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆339Jun 12, 2023Updated 2 years ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆773Apr 7, 2023Updated 2 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,107Oct 16, 2025Updated 5 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆325May 9, 2021Updated 4 years ago
- Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…☆110Apr 18, 2022Updated 3 years ago
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆146Feb 21, 2022Updated 4 years ago
- Dense hybrid representations for text retrieval☆64Apr 3, 2023Updated 2 years ago
- Multi-hop dense retrieval for question answering☆219Oct 12, 2021Updated 4 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆352Dec 21, 2023Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 2 years ago
- Codebase for RetroMAE and beyond.☆272Jun 7, 2024Updated last year
- Library for Knowledge Intensive Language Tasks☆970Mar 31, 2022Updated 3 years ago
- Autoregressive Entity Retrieval☆796Jul 6, 2023Updated 2 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆132Jan 3, 2022Updated 4 years ago
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆337Jun 17, 2023Updated 2 years ago