pavelchristof / template-scala-parallel-word2vecLinks
PredictionIO word2vec engine template (Scala-based parallelized engine)
☆12Updated 10 years ago
Alternatives and similar repositories for template-scala-parallel-word2vec
Users that are interested in template-scala-parallel-word2vec are comparing it to the libraries listed below
Sorting:
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- A Scala wrapper for CoreNLP☆40Updated 9 years ago
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 6 years ago
- scalding powered machine learning☆109Updated 10 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Using deep learning to POS tag sentences using scala + DL4J☆37Updated 10 years ago
- Machine Learning for Cascading☆82Updated 10 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Text Classification Engine☆36Updated 6 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Updated 6 years ago
- People. Places. Things. Graphs.☆92Updated 10 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 10 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Templates for projects based on top of H2O.☆38Updated 3 months ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- Vizlinc☆15Updated 9 years ago
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 2 years ago
- Movie recommendations and more in MapReduce and Scalding☆117Updated 12 years ago
- Easy distributed TensorFlow on Hadoop (moved to: hops-tensorflow)☆9Updated 8 years ago
- A toolkit to write UIMA components and applications☆23Updated 8 years ago