unchartedsoftware / ensemble-clustering
Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that leverage the different semantic aspects of heterogeneous data. The library can be used on a single machine using multi-threading or distributed computing using Spark.
☆32Updated 10 years ago
Alternatives and similar repositories for ensemble-clustering
Users that are interested in ensemble-clustering are comparing it to the libraries listed below
Sorting:
- The main - so far, only - repository for the SmileWide project.☆32Updated 9 years ago
- ☆20Updated 8 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled wit…☆18Updated 14 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- ☆20Updated 8 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 9 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 8 years ago
- SNAP repository for Ringo☆14Updated 7 years ago
- DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿☆17Updated 8 years ago
- REx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, a…☆22Updated 7 years ago
- ☆20Updated 7 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆97Updated 13 years ago
- Language Modeling with Sum-Product Networks☆20Updated 10 years ago
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆26Updated 4 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- Question Answering via Integer Programming (TableILP)☆28Updated 9 years ago