unchartedsoftware / ensemble-clusteringLinks
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.
☆32Updated 10 years ago
Alternatives and similar repositories for ensemble-clustering
Users that are interested in ensemble-clustering are comparing it to the libraries listed below
Sorting:
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- ☆20Updated 8 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆21Updated 3 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- Deep Recurrent Neural Nets in Java☆53Updated 9 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 11 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- NLP tools developed by Emory University.☆60Updated 9 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- ☆20Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- provides iSAX Java implementation☆14Updated 10 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- Implicit relation extractor using a natural language model.☆24Updated 7 years ago
- An implementation of Long Short Term Memory in Java.☆29Updated 12 years ago
- Deep RNNs, LSTM networks and automatic differentiation package in Java☆10Updated 9 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LD…☆27Updated 2 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Java implementation of the TextRank algorithm by Mihalcea, et al. http://lit.csci.unt.edu/index.php/Graph-based_NLP☆29Updated 4 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- Machine Learning Tool Kit☆137Updated 4 years ago