pavelchristof / template-scala-parallel-word2vec
PredictionIO word2vec engine template (Scala-based parallelized engine)
☆12Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for template-scala-parallel-word2vec
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 5 years ago
- Templates for projects based on top of H2O.☆37Updated 2 weeks ago
- Machine Learning for Cascading☆82Updated 9 years ago
- Text Classification Engine☆36Updated 5 years ago
- Easy distributed TensorFlow on Hadoop (moved to: hops-tensorflow)☆9Updated 7 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Updated 8 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- An example project for doing grid search in MLlib☆13Updated 9 years ago
- A Scala wrapper for CoreNLP☆40Updated 8 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 11 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- scalding powered machine learning☆109Updated 9 years ago
- ☆110Updated 7 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Updated 5 years ago
- Seldon Spark Jobs☆26Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 8 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated last month
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 9 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 6 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 6 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆91Updated 8 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago