ybenoit / scikit-learn-to-spark-ml
Notebook comparing scikit-learn and Spark ML for building Machine Learning Pipelines
☆13Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for scikit-learn-to-spark-ml
- Large-scale topic discovery with Sampled-MinHashing☆10Updated 5 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- An Information Extraction Framework with Deep Learning developed at New York University☆16Updated 8 years ago
- Code for Criteo competition http://www.kaggle.com/c/criteo-display-ad-challenge☆22Updated 10 years ago
- ☆26Updated 8 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆36Updated last year
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Updated 7 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Sequential model-based optimization with a `scipy.optimize` interface☆14Updated 7 years ago
- Python code for training Paragram word embeddings. These achieve human-level performance on some word similiarty tasks including SimLex-9…☆30Updated 8 years ago
- C++ neural network library☆21Updated 8 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11Updated 9 years ago
- ☆12Updated 7 years ago
- ☆10Updated 8 years ago
- Social Media and Text Analytics Course at UPenn☆24Updated last year
- Entity level sentiment analysis for product reviews using deep learning☆55Updated 8 years ago
- Low-rank Highway Networks☆14Updated 8 years ago
- ☆27Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆27Updated 7 years ago
- Different approaches to computing document similarity☆28Updated 7 years ago
- A simple CNN implementation in Keras.☆30Updated 8 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Updated 8 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 8 years ago