pisa-engine / BMPLinks
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆31Updated last month
Alternatives and similar repositories for BMP
Users that are interested in BMP are comparing it to the libraries listed below
Sorting:
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆66Updated 3 weeks ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆98Updated 3 weeks ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- A Python interface to PISA☆37Updated last month
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- CLIR version of ColBERT☆74Updated 4 months ago
- A list of multi-vector retrieval resources☆14Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆58Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆28Updated last year
- ☆52Updated 3 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆59Updated last week
- Model implementation for the contextual embeddings project☆36Updated 5 months ago
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆36Updated 4 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆169Updated 6 months ago
- Collection of datasets for benchmarking filtered vector similarity retrieval☆54Updated 5 months ago
- Retrieval-Augmented Generation battle!☆59Updated 3 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18Updated 5 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆37Updated last month
- ☆46Updated 3 years ago
- ☆13Updated 8 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆28Updated 5 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Updated 7 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 4 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆29Updated last week
- NLP with Rust for Python 🦀🐍☆66Updated 6 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆82Updated 9 months ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆16Updated this week
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆31Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago