A toolkit for building dense retrievers with deep language models.
☆64Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for Dense
Users that are interested in Dense are comparing it to the libraries listed below
Sorting:
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 3 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆727Jan 26, 2026Updated last month
- Build Text Rerankers with Deep Language Models☆265Feb 20, 2024Updated 2 years ago
- Code for KERM: Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking, accepted at SIGIR 2022.☆19Oct 31, 2022Updated 3 years ago
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆16Updated this week
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆19Jun 21, 2025Updated 8 months ago
- NAACL2021 - COIL Contextualized Lexical Retriever☆157Jul 27, 2021Updated 4 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆18Dec 16, 2024Updated last year
- Generated geosite.dat based on Antifilter Community List☆25Feb 22, 2026Updated last week
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- ☆32Aug 9, 2024Updated last year
- 🌏 UI component library for the future, based on WebComponent.☆23Nov 12, 2024Updated last year
- ☆24Oct 23, 2020Updated 5 years ago
- The code is for our AAAI2023 paper: Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs (Ding…☆10Dec 17, 2022Updated 3 years ago
- ☆44Mar 3, 2023Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- ☆10Apr 16, 2021Updated 4 years ago
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Sep 1, 2025Updated 6 months ago
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- ☆30Sep 25, 2024Updated last year
- An end-to-end neural ad-hoc ranking pipeline.☆152Jul 13, 2025Updated 7 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆30Apr 19, 2024Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆89Jul 3, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆32Mar 31, 2020Updated 5 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60May 17, 2023Updated 2 years ago
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆146Feb 21, 2022Updated 4 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- collecting publicly available distillation datasets based on DepSeek-R1☆26Mar 12, 2025Updated 11 months ago
- Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…☆22Feb 4, 2026Updated 3 weeks ago
- Repo. for RLCF.☆15Apr 1, 2024Updated last year