logicalguess / tf-idf-spark-and-pythonLinks
TF-IDF with Spark for the Kaggle popcorn competition
☆10Updated 10 years ago
Alternatives and similar repositories for tf-idf-spark-and-python
Users that are interested in tf-idf-spark-and-python are comparing it to the libraries listed below
Sorting:
- ☆25Updated 9 years ago
- Document or binary file vectorization with Normalized Compression Distance in Python.☆17Updated 9 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 11 years ago
- Active learning for Big Data☆25Updated 6 years ago
- Using Word2Vec on lists and sets☆34Updated last month
- Boosting and ensemble learning in Python.☆54Updated 10 years ago
- Predicting closed questions on Stack Overflow☆44Updated 7 years ago
- ☆44Updated 9 years ago
- The code that I used in Click-Through Rate Prediction (http://www.kaggle.com/c/avazu-ctr-prediction/) (C++). It implements the Follow The…☆12Updated 10 years ago
- My winning solution for Kaggle Higgs Machine Learning Challenge (single classifier, xgboost)☆82Updated 10 years ago
- A helper library for data science pipeline☆36Updated 6 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Updated 11 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- Kaggle Otto Group Product Classification Challenge☆35Updated 10 years ago
- Run Nx2 Cross Validation for multiple binary classifiers in parallel with optional downsampling☆13Updated 10 years ago
- Amazon access control challenge☆25Updated 11 years ago
- Prize winning solution to the SeeClickFix contest hosted on Kaggle, developed by teammates Bryan Gregory and Miroslaw Horbal. The purpose…☆26Updated 11 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- How to use automatic polynomial features and neural network mode in VW☆17Updated 11 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 10 years ago
- Code for the Kaggle Marinexplore challenge☆17Updated 12 years ago
- Repo for the Insults Detection challenge on Kaggle.com☆11Updated 12 years ago
- Various notebooks and tutorials on subjects of interest.☆36Updated 5 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 11 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Second-ranked solution to the Kaggle "Flavours of Physics" competition☆25Updated 9 years ago