CLIR version of ColBERT
β73Jun 23, 2025Updated 8 months ago
Alternatives and similar repositories for ColBERT-X
Users that are interested in ColBERT-X are comparing it to the libraries listed below
Sorting:
- SIGIR 2023 tutorial on cross language information retrieval.β13Feb 28, 2024Updated 2 years ago
- π Modular retrievers for zero-shot multilingual IR.β30Mar 6, 2024Updated last year
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024β68Oct 21, 2025Updated 4 months ago
- β89Apr 3, 2025Updated 11 months ago
- A multilingual version of MS MARCO passage ranking datasetβ147Oct 19, 2023Updated 2 years ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrievalβ36Oct 18, 2024Updated last year
- β19May 16, 2024Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ61Jun 20, 2024Updated last year
- β13Nov 15, 2017Updated 8 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.β199Jul 31, 2024Updated last year
- Benchmark datasets for sentiment analysisβ12May 18, 2020Updated 5 years ago
- A project which does the ColBERT pruning based on the LP or L1 normβ19Jun 11, 2025Updated 8 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ33Nov 21, 2025Updated 3 months ago
- A list of advice on doing research that is useful for me :)β13Aug 17, 2019Updated 6 years ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.β35Jan 14, 2026Updated last month
- β47Mar 27, 2022Updated 3 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.β727Jan 26, 2026Updated last month
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrievaβ¦β105Jan 27, 2026Updated last month
- A collection of Tantivy stemmer tokenizersβ18Jun 27, 2024Updated last year
- β11Feb 9, 2024Updated 2 years ago
- β17Mar 22, 2025Updated 11 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)β3,782Oct 14, 2025Updated 4 months ago
- Late Interaction Models Training & Retrievalβ732Updated this week
- StaRD: Statute Retrieval Dataset based on Real-World Legal Consultationβ20Apr 24, 2025Updated 10 months ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025β24Feb 6, 2026Updated 3 weeks ago
- Relative data structures based on the BWTβ12Apr 28, 2018Updated 7 years ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β185May 3, 2025Updated 10 months ago
- Benchmarking library for RAGβ260Feb 15, 2026Updated 2 weeks ago
- β28May 27, 2024Updated last year
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.β43Nov 20, 2025Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β82Mar 18, 2024Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrievalβ52Jan 6, 2026Updated last month
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agentβ33Aug 20, 2025Updated 6 months ago
- Code for the Ask4Help projectβ22Nov 24, 2022Updated 3 years ago
- Model implementation for the contextual embeddings projectβ40Jun 2, 2025Updated 9 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52May 19, 2023Updated 2 years ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)β979May 3, 2024Updated last year
- SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Searchβ23May 24, 2023Updated 2 years ago
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integratβ¦β103Nov 19, 2024Updated last year