unchartedsoftware / ensemble-clusteringLinks
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.
☆32Updated 10 years ago
Alternatives and similar repositories for ensemble-clustering
Users that are interested in ensemble-clustering are comparing it to the libraries listed below
Sorting:
- ☆20Updated 8 years ago
- The main - so far, only - repository for the SmileWide project.☆32Updated 9 years ago
- ☆20Updated 8 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿☆17Updated 8 years ago
- Vizlinc☆15Updated 9 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 8 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"☆11Updated 10 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Updated 9 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆21Updated 3 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 9 years ago
- Topic modeling with first-order logic (FOL) domain knowledge☆33Updated 13 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Updated 9 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- USC GoFFish Graph Analytics Framework☆33Updated 10 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Updated 12 years ago
- This toolkit consists of implementations of various graph-based semi-supervised learning (SSL) algorithms. Currently, three algorithms ar…☆151Updated 7 years ago
- Implicit relation extractor using a natural language model.☆24Updated 7 years ago
- Templates for projects based on top of H2O.☆38Updated 3 months ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago