☆96Jul 4, 2025Updated 9 months ago
Alternatives and similar repositories for fastkmeans
Users that are interested in fastkmeans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High-Performance Engine for Multi-Vector Search☆236Mar 25, 2026Updated 2 weeks ago
- Transform is the main building block of data pipelines in fastai. And elsewhere if you want.☆32Updated this week
- A simple python wrapper for using the Caddy API☆27Apr 4, 2026Updated last week
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- ☆55Jul 10, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- State-of-the-art paired encoder and decoder models (17M-1B params)☆65Aug 6, 2025Updated 8 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆189May 3, 2025Updated 11 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 4 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,605Dec 20, 2025Updated 3 months ago
- PyLate efficient inference engine☆81Jan 7, 2026Updated 3 months ago
- Late Interaction Models Training & Retrieval☆783Mar 6, 2026Updated last month
- An enterprise deep research benchmark☆35Mar 22, 2026Updated 3 weeks ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- ☆15Apr 26, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- decontamination☆30Mar 4, 2026Updated last month
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 10 months ago
- Robust Self-augmentation for NER with Meta-reweighting☆29Nov 8, 2022Updated 3 years ago
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆24Dec 10, 2025Updated 4 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 8 months ago
- Bringing BERT into modernity via both architecture changes and scaling☆1,652Mar 1, 2026Updated last month
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- anything you want can be built with morph cloud☆27Oct 14, 2025Updated 5 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆336Updated this week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆84Feb 10, 2026Updated 2 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 6 months ago
- Better Live Text for MacOS☆35Feb 8, 2026Updated 2 months ago
- NLP with Rust for Python 🦀🐍☆72May 13, 2025Updated 10 months ago
- Python library to use Pleias-RAG models☆71May 1, 2025Updated 11 months ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆54Jan 2, 2024Updated 2 years ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Fast, Modern, and Low Precision PyTorch Optimizers☆130Dec 29, 2025Updated 3 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆149Aug 9, 2024Updated last year
- The first dense retrieval model that can be prompted like an LM☆91May 8, 2025Updated 11 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆61Jun 20, 2024Updated last year
- Redesign of solar.lowtechmagazine.com in Hugo engine☆17Apr 26, 2025Updated 11 months ago
- Have UV deal with all your Jupyter deps.☆28Sep 7, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 11 months ago