A toolkit for building dense retrievers with deep language models.
☆64Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for Dense
Users that are interested in Dense are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Aug 10, 2023Updated 2 years ago
- SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search☆23May 24, 2023Updated 2 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆733Jan 26, 2026Updated 2 months ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Build Text Rerankers with Deep Language Models☆265Feb 20, 2024Updated 2 years ago
- Code for KERM: Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking, accepted at SIGIR 2022.☆19Oct 31, 2022Updated 3 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆16Updated this week
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆99May 9, 2023Updated 2 years ago
- NAACL2021 - COIL Contextualized Lexical Retriever☆157Jul 27, 2021Updated 4 years ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Train Dense Passage Retriever (DPR) with a single GPU☆136Jun 16, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 9 months ago
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- ☆24Oct 23, 2020Updated 5 years ago
- Generated geosite.dat based on Antifilter Community List☆25Apr 5, 2026Updated last week
- 🌏 UI component library for the future, based on WebComponent.☆23Nov 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆32Aug 9, 2024Updated last year
- ☆30Sep 25, 2024Updated last year
- Code for AAAI 2024 paper Wikiformer☆20Dec 21, 2023Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60May 17, 2023Updated 2 years ago
- An end-to-end neural ad-hoc ranking pipeline.☆153Jul 13, 2025Updated 8 months ago
- The code is for our AAAI2023 paper: Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs (Ding…☆10Dec 17, 2022Updated 3 years ago
- TransformerDB☆19Apr 22, 2021Updated 4 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆119Aug 7, 2024Updated last year
- ☆45Mar 3, 2023Updated 3 years ago
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆147Feb 21, 2022Updated 4 years ago
- ☆14Oct 29, 2020Updated 5 years ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆90Jul 3, 2024Updated last year
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆30Oct 1, 2024Updated last year
- Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…☆22Mar 24, 2026Updated 2 weeks ago