lightonai / pylateLinks

Late Interaction Models Training & Retrieval

☆511

Alternatives and similar repositories for pylate

Users that are interested in pylate are comparing it to the libraries listed below

Sorting:

mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 2 weeks ago
IBM / fastfit
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
☆210Updated 2 months ago
mixedbread-ai / baguetter
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆186Updated 11 months ago
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆433Updated last year
raphaelsty / neural-cherche
Neural Search
☆362Updated 4 months ago
xhluca / bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
☆1,258Updated last month
AnswerDotAI / fastdata
☆154Updated 8 months ago
MinishLab / semhash
Fast Semantic Text Deduplication & Filtering
☆774Updated 2 months ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆148Updated last month
lightonai / fast-plaid
High-Performance Engine for Multi-Vector Search
☆130Updated last month
flairNLP / transformer-ranker
Efficiently find the best-suited language model (LM) for your NLP task
☆125Updated last week
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆288Updated 2 months ago
huggingface / text-clustering
Easily embed, cluster and semantically label text datasets
☆558Updated last year
cohere-ai / DiskVectorIndex
☆210Updated last month
NVIDIA / logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆327Updated 3 weeks ago
MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆133Updated 7 months ago
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆284Updated 4 months ago
AmenRa / ranx
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
☆579Updated last week
mixedbread-ai / maxsim-cpu
☆35Updated 3 weeks ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆291Updated 3 weeks ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆195Updated 2 months ago
microsoft / MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
☆340Updated 7 months ago
castorini / rank_llm
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆506Updated last week
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆317Updated 2 months ago
jackboyla / GLiREL
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
☆229Updated last month
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆249Updated 5 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆133Updated this week
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 7 months ago
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆222Updated 2 weeks ago