hltcoe/ColBERT-X

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hltcoe/ColBERT-X)

hltcoe / ColBERT-X

CLIR version of ColBERT

☆73

Alternatives and similar repositories for ColBERT-X

Users that are interested in ColBERT-X are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hltcoe / patapsco
View on GitHub
Cross language information retrieval pipeline
☆19Jan 12, 2026Updated 6 months ago
ant-louis / xm-retrievers
View on GitHub
🌏 Modular retrievers for zero-shot multilingual IR.
☆30Mar 6, 2024Updated 2 years ago
CosimoRulli / emvb
View on GitHub
Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024
☆70Oct 21, 2025Updated 9 months ago
unicamp-dl / mMARCO
View on GitHub
A multilingual version of MS MARCO passage ranking dataset
☆148Oct 19, 2023Updated 2 years ago
Furyton / GR-as-MVDR
View on GitHub
[SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval
☆36Oct 18, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mjeensung / xtr-pytorch
View on GitHub
☆19May 16, 2024Updated 2 years ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
DeployQL / awesome-multi-vector
View on GitHub
A list of multi-vector retrieval resources
☆19May 29, 2024Updated 2 years ago
project-miracl / miracl
View on GitHub
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
☆211Jul 31, 2024Updated last year
seanmacavaney / plaidrepro
View on GitHub
☆11Feb 9, 2024Updated 2 years ago
google-deepmind / xtr
View on GitHub
XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
☆64Jun 20, 2024Updated 2 years ago
terrierteam / pyterrier_colbert
View on GitHub
☆89Apr 3, 2025Updated last year
kyriemao / LeCoRE
View on GitHub
Code of LeCoRE
☆14Feb 15, 2023Updated 3 years ago
TusKANNy / seismic
View on GitHub
Official repository of the Seismic library.
☆135Jul 6, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DrJZhou / Datasets-for-Sentiment-Analysis
View on GitHub
Benchmark datasets for sentiment analysis
☆13May 18, 2020Updated 6 years ago
stanford-futuredata / ColBERT
View on GitHub
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,902Oct 14, 2025Updated 9 months ago
TREMA-UNH / rubric-grading-workbench
View on GitHub
A Workbench for Autograding Retrieve/Generate Systems
☆15Jun 30, 2025Updated last year
emory-irlab / pyterrier_genrank
View on GitHub
Generative Reranker PyTerrier
☆18Dec 1, 2025Updated 7 months ago
ChillingDream / DAP
View on GitHub
ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
☆24Aug 21, 2024Updated last year
xuanyuan14 / ARES
View on GitHub
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
☆23May 24, 2023Updated 3 years ago
vector-index-bench / vibe
View on GitHub
Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest neighbor search methods, or vector index…
☆44Updated this week
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
stanford-futuredata / Baleen
View on GitHub
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)
☆48Dec 27, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
monologg / EncT5
View on GitHub
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆62Jan 22, 2022Updated 4 years ago
pau-mensa / xtr-warp-rs
View on GitHub
High performance implementation of the WARP (SIGIR'25) retrieval engine.
☆36May 21, 2026Updated 2 months ago
searchhub / search-collector
View on GitHub
A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.
☆43Nov 20, 2025Updated 8 months ago
naver / bergen
View on GitHub
Benchmarking library for RAG
☆275Jul 14, 2026Updated last week
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
jlscheerer / xtr-warp
View on GitHub
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆209May 3, 2025Updated last year
UKPLab / acl2024-dapr
View on GitHub
☆28May 27, 2024Updated 2 years ago
ielab / PromptReps
View on GitHub
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆52Jan 6, 2026Updated 6 months ago
jfkback / hypencoder-paper
View on GitHub
Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"
☆41Sep 20, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TusKANNy / awesome-multivector-retrieval
View on GitHub
An extensive and commented list of resources on Late-Interaction Multivector Retrieval.
☆68Updated this week
THU-KEG / Xlore2.0
View on GitHub
Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]
☆12Apr 5, 2017Updated 9 years ago
Hannibal046 / nanoColBERT
View on GitHub
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆83Mar 18, 2024Updated 2 years ago
mammothb / editdistpy
View on GitHub
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…
☆26Updated this week
tunib-ai / joker
View on GitHub
AI model designed to test the effectiveness in handling external ethical attacks.
☆11Feb 9, 2026Updated 5 months ago
jingtaozhan / JPQ
View on GitHub
CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
☆52Feb 19, 2022Updated 4 years ago
musabgultekin / functionary
View on GitHub
Chat language model that can interpret and execute functions/plugins
☆14Oct 16, 2024Updated last year