derrickburns / generalized-kmeans-clusteringLinks
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
☆304Updated 3 weeks ago
Alternatives and similar repositories for generalized-kmeans-clustering
Users that are interested in generalized-kmeans-clustering are comparing it to the libraries listed below
Sorting:
- ☆673Updated 3 weeks ago
- Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems☆105Updated 3 months ago
- A BERT that you can train on a (gaming) laptop.☆209Updated last year
- Docker-based inference engine for AMD GPUs☆231Updated 9 months ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆97Updated last year
- Examples and guides for using the VLM Run API☆283Updated this week
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 4 months ago
- R.L. methods and techniques.☆199Updated 8 months ago
- Optimally allocate poker chips using constrained, nonlinear optimization☆174Updated 7 months ago
- Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.☆150Updated last year
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆396Updated 4 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆286Updated 2 weeks ago
- ☆51Updated last year
- Lamport's Bakery Algorithm Demonstrated in Python☆96Updated last year
- Simplifying robust end-to-end machine learning on Apache Spark.☆472Updated 8 years ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆253Updated last year
- ☆278Updated last month
- Hybrid search engine, combining best features of text and semantic search worlds☆483Updated last week
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Updated last year
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- Run and explore Llama models locally with minimal dependencies on CPU☆191Updated 9 months ago
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- Grow virtual creatures in static and physics simulated environments.☆53Updated last year
- Self-contained worked examples of Apache Lucene features and functionality☆204Updated last week
- A hub for various industry-specific schemas to be used with VLMs.☆525Updated last month
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆278Updated 3 weeks ago
- Dead Simple LLM Abliteration☆224Updated 5 months ago
- Automated, smooth, N'th order derivatives of non-uniformly sampled time series data☆227Updated 8 months ago
- Finds the school district associated with a given street address in the United States☆50Updated 11 months ago
- Algebraic enhancements for GEMM & AI accelerators☆277Updated 4 months ago