unchartedsoftware / ensemble-clusteringLinks
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.
☆32Updated 10 years ago
Alternatives and similar repositories for ensemble-clustering
Users that are interested in ensemble-clustering are comparing it to the libraries listed below
Sorting:
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆35Updated 6 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Deep Recurrent Neural Nets in Java☆53Updated 10 years ago
- ☆20Updated 9 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆22Updated 4 years ago
- Very Fast Machine Learning Toolkit☆28Updated 12 years ago
- A recurrent neural network heavily inspired by Long Short Term Memory, but simpler.☆21Updated 12 years ago
- An implementation of Long Short Term Memory in Java.☆29Updated 12 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 4 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 9 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Deep RNNs, LSTM networks and automatic differentiation package in Java☆10Updated 10 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆57Updated 13 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 12 years ago
- provides iSAX Java implementation☆14Updated 10 years ago
- Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LD…☆28Updated 2 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆27Updated 4 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 6 years ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆202Updated 5 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 10 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆148Updated 4 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 8 years ago
- ☆20Updated 8 years ago
- Java text categorization system☆57Updated 8 years ago