logicalguess / tf-idf-spark-and-python
TF-IDF with Spark for the Kaggle popcorn competition
☆10Updated 9 years ago
Alternatives and similar repositories for tf-idf-spark-and-python:
Users that are interested in tf-idf-spark-and-python are comparing it to the libraries listed below
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- ☆25Updated 8 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- Document or binary file vectorization with Normalized Compression Distance in Python.☆17Updated 9 years ago
- A board game recommendation engine/model/website.☆39Updated 8 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Amazon access control challenge☆25Updated 10 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆19Updated 7 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 11 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- Boosting and ensemble learning in Python.☆54Updated 10 years ago
- Large scale matrix factorization on GPU☆19Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- A helper library for data science pipeline☆36Updated 5 years ago
- Code for KDD 2014☆16Updated 9 years ago
- Code for the Kaggle Marinexplore challenge☆17Updated 12 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Topic analysis using RSM or PVDM.☆11Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- ☆26Updated 9 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- ☆44Updated 9 years ago
- Various notebooks and tutorials on subjects of interest.☆36Updated 4 years ago
- Gaussian Process Factorization Machines for Context-aware Recommendations☆42Updated 9 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- Scikit-learn quickstart tutorial for Webstep☆19Updated 7 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Updated 10 years ago
- ☆8Updated 10 years ago
- GSOC 2017 - Apache Organization - # Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python…☆14Updated 8 years ago