stanford-futuredata/ColBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanford-futuredata/ColBERT)

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

☆3,899

Alternatives and similar repositories for ColBERT

Users that are interested in ColBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,937May 17, 2025Updated last year
castorini / pyserini
View on GitHub
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
☆2,098Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,152Updated this week
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,237Oct 16, 2025Updated 9 months ago
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Jul 3, 2026Updated 2 weeks ago
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆874Updated this week
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,911Updated this week
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,623Dec 20, 2025Updated 6 months ago
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,867Apr 6, 2023Updated 3 years ago
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆779Apr 7, 2023Updated 3 years ago
zetaalphavector / InPars
View on GitHub
Inquisitive Parrots for Search
☆200Jun 5, 2025Updated last year
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,735Jul 7, 2026Updated last week
castorini / docTTTTTquery
View on GitHub
docTTTTTquery document expansion model
☆377Mar 25, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆14,511Updated this week
stanfordnlp / ColBERT-QA
View on GitHub
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
☆39Aug 2, 2021Updated 4 years ago
xlang-ai / instructor-embedding
View on GitHub
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
☆2,024Jan 15, 2025Updated last year
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,943Apr 22, 2026Updated 2 months ago
IntelLabs / fastRAG
View on GitHub
Efficient Retrieval Augmentation and Generation Framework
☆1,785Jan 12, 2026Updated 6 months ago
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,772May 26, 2026Updated last month
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,200Updated this week
raphaelsty / neural-cherche
View on GitHub
Neural Search
☆371Mar 11, 2025Updated last year
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,535Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vibrantlabsai / ragas
View on GitHub
Supercharge Your LLM Application Evaluations 🚀
☆14,850Feb 24, 2026Updated 4 months ago
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆50,868Updated this week
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 2 weeks ago
deepset-ai / haystack
View on GitHub
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…
☆25,903Updated this week
terrierteam / pyterrier_colbert
View on GitHub
☆89Apr 3, 2025Updated last year
facebookresearch / faiss
View on GitHub
A library for efficient similarity search and clustering of dense vectors.
☆40,517Updated this week
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,667May 21, 2026Updated last month
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,036Updated this week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,850Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,358Updated this week
castorini / rank_llm
View on GitHub
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆609Jun 19, 2026Updated last month
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆86,341Updated this week
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,872Mar 21, 2026Updated 3 months ago
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,948Apr 13, 2026Updated 3 months ago
allenai / ir_datasets
View on GitHub
Provides a common interface to many IR ranking datasets.
☆390May 28, 2026Updated last month
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,327Updated this week