tapanalyticstoolkit / spark-tkLinks
Python and Scala APIs for enhanced Spark analytics
☆12Updated 8 years ago
Alternatives and similar repositories for spark-tk
Users that are interested in spark-tk are comparing it to the libraries listed below
Sorting:
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 9 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- Regularized latent variable mixed membership modeling☆13Updated 12 years ago
- A neural network library which trained by Spark RDD instances.☆23Updated 9 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 6 years ago
- GCN implementation on top of Apache Spark☆16Updated 2 years ago
- Java port of c++ version of facebook fasttext☆13Updated 7 years ago
- Real-time query spark and visualise it as graph.☆24Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- ☆14Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆44Updated 9 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- Keyword extraction package for Spark.☆12Updated 8 years ago
- ☆37Updated 7 years ago
- How to use LSTM trained in Keras in your Java project.☆29Updated 9 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 11 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆23Updated 8 years ago
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- Examples + Visualizations of datasets modeled using automl-gs☆16Updated 6 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 10 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago
- ☆20Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Updated 7 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Updated 9 years ago