lightonai / fast-plaidLinks
High-Performance Engine for Multi-Vector Search
☆162Updated 2 weeks ago
Alternatives and similar repositories for fast-plaid
Users that are interested in fast-plaid are comparing it to the libraries listed below
Sorting:
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆149Updated 2 months ago
- ☆77Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- Simple UI for debugging correlations of text embeddings☆292Updated 4 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆166Updated 5 months ago
- ☆159Updated 10 months ago
- ☆41Updated 3 months ago
- Late Interaction Models Training & Retrieval☆610Updated 2 weeks ago
- NLP with Rust for Python 🦀🐍☆65Updated 4 months ago
- Generalist and Lightweight Model for Text Classification☆162Updated 3 months ago
- A framework for optimizing DSPy programs with RL☆191Updated this week
- Crispy reranking models by Mixedbread☆36Updated 3 weeks ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last week
- ☆210Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last year
- ☆30Updated last year
- Python library to use Pleias-RAG models☆63Updated 5 months ago
- Pre-train Static Word Embeddings☆86Updated last month
- Tools to make language models a bit easier to use☆54Updated 2 weeks ago
- PyLate efficient inference engine☆65Updated 3 weeks ago
- A small library of LLM judges☆288Updated 2 months ago
- ☆68Updated 4 months ago
- Datamodels for hugging face tokenizers☆77Updated 2 weeks ago
- Efficient vector database for hundred millions of embeddings.☆208Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆32Updated last year
- Plug-and-play, zero-shot document processing pipelines.☆107Updated this week
- An introduction to LLM Sampling☆79Updated 9 months ago
- Inference-time scaling for LLMs-as-a-judge.☆300Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆334Updated last month