thongnt99/learned-sparse-retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thongnt99/learned-sparse-retrieval)

thongnt99 / learned-sparse-retrieval

Unified Learned Sparse Retrieval Framework

☆68

Alternatives and similar repositories for learned-sparse-retrieval

Users that are interested in learned-sparse-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thakur-nandan / sprint
View on GitHub
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
☆48Jul 25, 2023Updated 3 years ago
ielab / asyncval
View on GitHub
A toolkit for asynchronously validating dense retriever checkpoints during training.
☆27Aug 10, 2023Updated 2 years ago
ielab / TILDE
View on GitHub
☆39Nov 21, 2022Updated 3 years ago
tira-io / ir-experiment-platform
View on GitHub
☆31Sep 25, 2024Updated last year
nreimers / beir-sparta
View on GitHub
Re-Implementation of SPARTA model
☆13Oct 1, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luyug / COIL
View on GitHub
NAACL2021 - COIL Contextualized Lexical Retriever
☆158Jul 27, 2021Updated 4 years ago
allenai / ir_datasets
View on GitHub
Provides a common interface to many IR ranking datasets.
☆390May 28, 2026Updated last month
capreolus-ir / capreolus
View on GitHub
A toolkit for end-to-end neural ad hoc retrieval
☆98Aug 20, 2024Updated last year
thongnt99 / lsr-multimodal
View on GitHub
ECIR 2024: Sparse lexical representation for image-text retrieval
☆13Jul 8, 2024Updated 2 years ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
DI4IR / SIGIR2021
View on GitHub
☆24Jun 28, 2023Updated 3 years ago
TusKANNy / seismic
View on GitHub
Official repository of the Seismic library.
☆135Jul 6, 2026Updated 2 weeks ago
sebastian-hofstaetter / matchmaker
View on GitHub
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
☆265Jan 27, 2023Updated 3 years ago
psm1206 / MAWU
View on GitHub
[CIKM'23] "Toward a Better Understanding of Loss Functions for Collaborative Filtering"
☆19Jan 23, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
terrierteam / pyterrier_adaptive
View on GitHub
☆18Jun 16, 2026Updated last month
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
eunseongc / SpaDE
View on GitHub
This is the official implementation of SpaDE. (CIKM'22)
☆22Aug 8, 2023Updated 2 years ago
skleee / GLEN
View on GitHub
This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".
☆29Aug 25, 2025Updated 10 months ago
cmacdonald / pyt_splade
View on GitHub
☆15Jun 26, 2026Updated 3 weeks ago
ict-bigdatalab / CorpusBrain
View on GitHub
CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks
☆34Aug 31, 2022Updated 3 years ago
drogozhang / LED
View on GitHub
Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)
☆22Aug 28, 2023Updated 2 years ago
algoprog / Faspect
View on GitHub
A library for open domain query facet extraction and generation
☆16Apr 24, 2024Updated 2 years ago
sebastian-hofstaetter / colberter
View on GitHub
☆47Mar 27, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
joaopalotti / trectools
View on GitHub
A simple toolkit to process TREC files in Python.
☆174Aug 24, 2024Updated last year
OpenMatch / ANCE-Tele
View on GitHub
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Mar 25, 2024Updated 2 years ago
skleee / GRUT
View on GitHub
This is the official code for the EMNLP findings 2025 paper "Enhancing Time Awareness in Generative Recommendation".
☆19May 24, 2026Updated 2 months ago
zetaalphavector / InPars
View on GitHub
Inquisitive Parrots for Search
☆200Jun 5, 2025Updated last year
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
eunseongc / CARE
View on GitHub
Official repository for "Conflict-Aware Soft Prompting for Retrieval-Augmented Generation" (EMNLP 2025)
☆20Nov 13, 2025Updated 8 months ago
luyug / Condenser
View on GitHub
EMNLP 2021 - Pre-training architectures for dense retrieval
☆256Mar 18, 2022Updated 4 years ago
ielab / Starbucks
View on GitHub
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆25Jun 30, 2025Updated last year
sudokim / 4th-Bookathon-The-Unbearable-Heaviness-of-GPT
View on GitHub
제4회 AI × Bookathon 우수상
☆14Jan 20, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zhichaoxu-shufe / RankMamba
View on GitHub
☆18Mar 30, 2024Updated 2 years ago
canjiali / PARADE
View on GitHub
code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…
☆96Mar 25, 2023Updated 3 years ago
hltcoe / patapsco
View on GitHub
Cross language information retrieval pipeline
☆19Jan 12, 2026Updated 6 months ago
sebastian-hofstaetter / neural-ranking-kd
View on GitHub
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
☆117Jul 11, 2021Updated 5 years ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
luyug / Reranker
View on GitHub
Build Text Rerankers with Deep Language Models
☆265Feb 20, 2024Updated 2 years ago
webis-de / rank-distillm
View on GitHub
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking
☆25Apr 4, 2025Updated last year