logicalguess / tf-idf-spark-and-python
TF-IDF with Spark for the Kaggle popcorn competition
☆10Updated 9 years ago
Alternatives and similar repositories for tf-idf-spark-and-python
Users that are interested in tf-idf-spark-and-python are comparing it to the libraries listed below
Sorting:
- ☆25Updated 9 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- ☆26Updated 9 years ago
- Document or binary file vectorization with Normalized Compression Distance in Python.☆17Updated 9 years ago
- A board game recommendation engine/model/website.☆39Updated 8 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 9 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- Active learning for Big Data☆25Updated 6 years ago
- ☆44Updated 9 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- Various notebooks and tutorials on subjects of interest.☆36Updated 4 years ago
- Second-ranked solution to the Kaggle "Flavours of Physics" competition☆25Updated 9 years ago
- Reimplementation of deepwalk algorithm from https://github.com/phanein/deepwalk☆38Updated 9 years ago
- Scikit-learn API toy wrapper for Regularized Greedy Forests☆44Updated 9 years ago
- How to use automatic polynomial features and neural network mode in VW☆17Updated 10 years ago
- ☆49Updated 7 years ago
- Kaggle Otto Group Product Classification Challenge☆35Updated 9 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 11 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- 3rd place solution for the Avito Context contest on kaggle.com☆29Updated 9 years ago
- Amazon access control challenge☆25Updated 10 years ago
- Scikit-learn quickstart tutorial for Webstep☆19Updated 8 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Topic analysis using RSM or PVDM.☆11Updated 10 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago