AnswerDotAI / fastkmeansLinks
β64Updated last month
Alternatives and similar repositories for fastkmeans
Users that are interested in fastkmeans are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ64Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last year
- β49Updated 6 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β63Updated last week
- An introduction to LLM Samplingβ79Updated 8 months ago
- Pre-train Static Word Embeddingsβ85Updated 2 months ago
- High-Performance Engine for Multi-Vector Searchβ146Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β143Updated last month
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- PyLate efficient inference engineβ62Updated last month
- Crispy reranking models by Mixedbreadβ34Updated last month
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β188Updated 11 months ago
- β56Updated 3 months ago
- Python library to use Pleias-RAG modelsβ61Updated 3 months ago
- β80Updated 2 months ago
- code for training & evaluating Contextual Document Embedding modelsβ197Updated 3 months ago
- β31Updated 9 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β83Updated 2 weeks ago
- Generalist and Lightweight Model for Text Classificationβ155Updated 2 months ago
- Supercharge huggingface transformers with model parallelism.β77Updated last month
- Tools to make language models a bit easier to useβ48Updated last month
- β38Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 6 months ago
- PyTorch implementation for MRLβ19Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β154Updated 3 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β66Updated last month
- β133Updated last week
- β155Updated 8 months ago
- β210Updated last month
- β54Updated 9 months ago