A toolkit for building dense retrievers with deep language models.
☆64Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for Dense
Users that are interested in Dense are comparing it to the libraries listed below
Sorting:
- SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search☆23May 24, 2023Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated last month
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Build Text Rerankers with Deep Language Models☆265Feb 20, 2024Updated 2 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- NAACL2021 - COIL Contextualized Lexical Retriever☆157Jul 27, 2021Updated 4 years ago
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆98May 9, 2023Updated 2 years ago
- ☆19Jun 21, 2025Updated 9 months ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Train Dense Passage Retriever (DPR) with a single GPU☆136Jun 16, 2021Updated 4 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- ☆24Oct 23, 2020Updated 5 years ago
- Generated geosite.dat based on Antifilter Community List☆25Mar 15, 2026Updated last week
- 🌏 UI component library for the future, based on WebComponent.☆23Nov 12, 2024Updated last year
- ☆32Aug 9, 2024Updated last year
- ☆30Sep 25, 2024Updated last year
- Code for AAAI 2024 paper Wikiformer☆20Dec 21, 2023Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60May 17, 2023Updated 2 years ago
- An end-to-end neural ad-hoc ranking pipeline.☆153Jul 13, 2025Updated 8 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Aug 7, 2024Updated last year
- ☆45Mar 3, 2023Updated 3 years ago
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆146Feb 21, 2022Updated 4 years ago
- ☆14Oct 29, 2020Updated 5 years ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆89Jul 3, 2024Updated last year
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- ☆70Jun 16, 2022Updated 3 years ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 9 months ago
- An Open-Source Package for Information Retrieval☆168Mar 9, 2026Updated last week
- ☆44Oct 1, 2024Updated last year
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated last year