tapanalyticstoolkit / spark-tkLinks
Python and Scala APIs for enhanced Spark analytics
☆12Updated 8 years ago
Alternatives and similar repositories for spark-tk
Users that are interested in spark-tk are comparing it to the libraries listed below
Sorting:
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Multinomial Factorization Machines☆21Updated 8 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago
- Python integration for Gradientzoo - version and share your trained neural network model weights☆12Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- Exploration Library in Java☆12Updated last year
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- the python code of the book:Machine Learning for Spark☆8Updated 8 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- Vector-free L-BFGS implementation on Spark☆9Updated 9 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆23Updated 7 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- Pure Java implementation of XGBoost predictor for online prediction tasks.☆27Updated 2 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 9 years ago
- ☆37Updated 6 years ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- ☆14Updated 8 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆20Updated 8 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆9Updated 4 years ago
- Classifying economics articles using Latent Dirichlet Allocation☆8Updated 8 years ago
- Mirror of Apache Spark☆11Updated last week
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago