pisa-engine / BMPLinks
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆31Updated 2 weeks ago
Alternatives and similar repositories for BMP
Users that are interested in BMP are comparing it to the libraries listed below
Sorting:
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆62Updated 9 months ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆74Updated last week
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆28Updated 2 weeks ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆28Updated last year
- CLIR version of ColBERT☆70Updated 3 weeks ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆52Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated last year
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆57Updated last week
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆23Updated 3 months ago
- A Python interface to PISA☆36Updated last month
- Model implementation for the contextual embeddings project☆33Updated last month
- ☆50Updated 4 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆80Updated 5 months ago
- Tree-based indexes for neural-search☆32Updated last year
- Source code and data for Like a Good Nearest Neighbor☆29Updated 6 months ago
- Collection of datasets for benchmarking filtered vector similarity retrieval☆47Updated last month
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated last year
- [Official Codes] Synthetic Test Collections for Retrieval Evaluation (SIGIR 2024)☆10Updated 11 months ago
- ESPN: Embedding from Storage Pipelined Network. GDS implementation for multi-vector embedding retrieval and bindings.☆11Updated last year
- A list of multi-vector retrieval resources☆13Updated last year
- ☆86Updated 3 months ago
- ☆10Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆17Updated last month
- Retrieval-Augmented Generation battle!☆52Updated 6 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 2 weeks ago