unchartedsoftware / ensemble-clusteringLinks
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data.  The library can be used on a single machine using multi-threading or distributed computing using Spark.
☆32Updated 10 years ago
Alternatives and similar repositories for ensemble-clustering
Users that are interested in ensemble-clustering are comparing it to the libraries listed below
Sorting:
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- GPU Acceleration for Apache Spark☆34Updated 10 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆22Updated 3 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 9 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆148Updated 3 years ago
- ☆20Updated 9 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 9 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 4 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Deep Recurrent Neural Nets in Java☆53Updated 10 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 12 years ago
- An implementation of Long Short Term Memory in Java.☆29Updated 12 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- A recurrent neural network heavily inspired by Long Short Term Memory, but simpler.☆21Updated 12 years ago
- DBpedia.org RDF to CSV for import into Neo4j☆52Updated 10 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 10 years ago
- A set of hacks to setup a dbpedia endpoint through neo4j☆44Updated 12 years ago
- Java text categorization system☆57Updated 8 years ago
- Deep RNNs, LSTM networks and automatic differentiation package in Java☆10Updated 9 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- ☆20Updated 7 years ago