⚡ Super fast clustering for high-dimensional vectors on CPUs (x86, ARM) and GPUs — for Python and C++. 100x faster clustering of vector embeddings than FAISS
☆50Mar 26, 2026Updated this week
Alternatives and similar repositories for SuperKMeans
Users that are interested in SuperKMeans are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artifact Evaluation for SpecFS [FAST'26]☆29Dec 28, 2025Updated 2 months ago
- High-Performance K-Means Clustering Library☆41Jul 6, 2025Updated 8 months ago
- Regional Ocean Forecasting with Hierarchical Graph Neural Networks☆20Aug 7, 2025Updated 7 months ago
- GenDB, an LLM-Powered Generative Query Engine Built for the Future☆51Updated this week
- ☆21Mar 15, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [VLDB'22] Source code for the paper: A Cache-Aware Learned Index with a Cost-based Construction Algorithm.☆10Jan 3, 2022Updated 4 years ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆35Jan 14, 2026Updated 2 months ago
- [SIGMOD’24] Source code for the paper: Making In-Memory Learned Indexes Efficient on Disk☆13Jun 28, 2024Updated last year
- Software engineering lab☆32Feb 16, 2020Updated 6 years ago
- ☆15Apr 14, 2023Updated 2 years ago
- [SIGMOD'25] We show the data chunk compaction problem in vectorized execution, and propose practical compaction solutions.☆14Oct 10, 2025Updated 5 months ago
- [SIGMOD'25] Source code for the paper: Debunking the Myth of Join Ordering: Toward Robust SQL Analytics☆21Sep 4, 2025Updated 6 months ago
- Visual Transformer Mechanistic Analysis Tool☆36Jun 3, 2023Updated 2 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆24Apr 18, 2021Updated 4 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Mar 14, 2026Updated last week
- Reducing the cache misses of SIMD vectorization using IMV☆29Jun 29, 2022Updated 3 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 6 months ago
- ☆25Sep 11, 2023Updated 2 years ago
- Better Live Text for MacOS☆33Feb 8, 2026Updated last month
- Multi-AI adversarial PR review tool☆68Mar 16, 2026Updated last week
- compare elastic and clickhouse☆24May 6, 2021Updated 4 years ago
- ☆14Jul 7, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆31Aug 16, 2019Updated 6 years ago
- Approximate Nearest Neighbor search using reduced-rank regression, with extremely fast queries, tiny memory usage, and rapid indexing on …☆51Dec 11, 2025Updated 3 months ago
- ☆12Jun 5, 2025Updated 9 months ago
- AlphaFlow Reinforcement Learning☆10Apr 13, 2023Updated 2 years ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆24Mar 11, 2026Updated 2 weeks ago
- SQLStorm: Taking Database Benchmarking into the LLM Era☆78Jan 2, 2026Updated 2 months ago
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 4 years ago
- A BM25 embedder, scorer, and search engine, written in Rust.☆57Mar 9, 2026Updated 2 weeks ago
- ☆31Dec 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [VLDB'24] Blitzcrank is to compress in-memory, OLTP databases. It introduces a new entropy coding algorithm named Delayed Coding.☆39Sep 20, 2024Updated last year
- An extension to Spotipy for anonymous access to the Spotify Web API☆12Nov 7, 2025Updated 4 months ago
- ☆20Nov 23, 2022Updated 3 years ago
- Oracle NoSQL Database. Designed for today’s most demanding applications that require low latency responses, flexible data models, and e…☆51Mar 2, 2026Updated 3 weeks ago
- A polyfill for the ECMAScript proposal “Set Methods for JavaScript”☆14Aug 28, 2023Updated 2 years ago
- SQL-ProcBench is an open benchmark for procedural workloads in RDBMSs.☆48Sep 25, 2021Updated 4 years ago
- ☆23Jun 18, 2024Updated last year