ybenoit / scikit-learn-to-spark-ml
Notebook comparing scikit-learn and Spark ML for building Machine Learning Pipelines
☆13Updated 9 years ago
Alternatives and similar repositories for scikit-learn-to-spark-ml:
Users that are interested in scikit-learn-to-spark-ml are comparing it to the libraries listed below
- Large-scale topic discovery with Sampled-MinHashing☆10Updated 5 years ago
- Classifying economics articles using Latent Dirichlet Allocation☆8Updated 8 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- ☆16Updated 6 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- An Information Extraction Framework with Deep Learning developed at New York University☆15Updated 8 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- ☆10Updated 9 years ago
- Social Media and Text Analytics Course at UPenn☆24Updated last year
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆36Updated 2 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Python code for training Paragram word embeddings. These achieve human-level performance on some word similiarty tasks including SimLex-9…☆30Updated 9 years ago
- ☆25Updated 8 years ago
- Different approaches to computing document similarity☆28Updated 8 years ago
- A Keras model that addresses the Quora Question Pairs dyadic prediction task.☆14Updated 8 years ago
- Inspired by the neural style algorithm in the computer vision field, we propose a high-level language model with the aim of adapting the …☆19Updated 2 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 9 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi …☆11Updated 9 years ago
- ☆13Updated 7 years ago
- ☆29Updated 9 years ago
- Sequential model-based optimization with a `scipy.optimize` interface☆15Updated 7 years ago
- ☆26Updated 7 years ago
- Talk on "Bayesian optimisation", beginner level☆25Updated 8 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- A simple CNN implementation in Keras.☆30Updated 8 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Code for Criteo competition http://www.kaggle.com/c/criteo-display-ad-challenge☆22Updated 10 years ago
- Neural Network Models for Multi-label learning☆17Updated 4 years ago
- Predicting sales with Pandas☆15Updated 9 years ago