pisa-engine / BMPLinks
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆31Updated 3 weeks ago
Alternatives and similar repositories for BMP
Users that are interested in BMP are comparing it to the libraries listed below
Sorting:
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆64Updated 10 months ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆80Updated last month
- A list of multi-vector retrieval resources☆14Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated last year
- A Python interface to PISA☆36Updated 2 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- ☆53Updated last month
- Collection of datasets for benchmarking filtered vector similarity retrieval☆48Updated 2 months ago
- CLIR version of ColBERT☆72Updated 2 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆59Updated 2 weeks ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆14Updated 2 weeks ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆28Updated last year
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆54Updated 2 weeks ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆34Updated last month
- Model implementation for the contextual embeddings project☆35Updated 2 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆154Updated 3 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆53Updated last year
- A holistic framework to construct realistic evaluation datasets☆23Updated 2 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆23Updated 4 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆82Updated 7 months ago
- ☆12Updated 7 years ago
- ☆46Updated 3 years ago
- Efficient BM25 with DuckDB 🦆☆55Updated 8 months ago
- hnsw implemented by python☆20Updated 5 years ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 2 months ago
- hnsw implemented by python☆69Updated 6 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆29Updated 5 months ago
- Graph Library for Approximate Similarity Search☆129Updated last month
- DESSERT Effeciently Searches Sets of Embeddings via Retrieval Tables☆16Updated last year
- ☆12Updated 7 months ago