zygmuntz / phraug
A set of simple Python scripts for pre-processing large files
☆272Updated 11 months ago
Alternatives and similar repositories for phraug:
Users that are interested in phraug are comparing it to the libraries listed below
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆206Updated 6 years ago
- A Python wrapper for the libffm library.☆243Updated 6 years ago
- fast_tffm: Tensorflow-based Distributed Factorization Machine☆143Updated 8 years ago
- ☆108Updated 7 years ago
- ☆77Updated 8 years ago
- Winning solution to the Avito CTR competition☆137Updated 9 years ago
- Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)☆229Updated 8 years ago
- Code for the 3rd place finish for Avazu Click-Through Rate Prediction☆87Updated 10 years ago
- Kaggle 'Search Results Relevance' 2nd place solution☆79Updated 9 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆98Updated 10 years ago
- Distributed Factorization Machines☆297Updated 9 years ago
- Criteo/Kaggle Competition of CTR prediction☆130Updated 10 years ago
- ☆447Updated 7 months ago
- Some small utility modules to help with pandas, numpy and sklearn usage in other projects☆183Updated 2 years ago
- Distributed Deep Learning on Spark☆402Updated 8 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- ☆190Updated last year
- Top15 Solution for Kaggle-Competition "Liberty Mutual Group: Property Inspection Prediction"☆50Updated 9 years ago
- Just some of my kaggle scripts☆88Updated 9 years ago
- Python implementation of stacked generalization classifier. Plays nice with sklearn.☆71Updated 8 years ago
- Bayesian Optimization using xgboost and sklearn API☆226Updated 9 years ago
- My best submission to the Kaggle competition "Predicting a Biological Response", ranked 17th over 711 teams.☆440Updated 8 years ago
- An implementation of Caruana et al's Ensemble Selection algorithm in Python, based on scikit-learn☆151Updated 4 years ago
- Amazon Employee Access Challenge☆207Updated 11 years ago
- my public kaggle code☆79Updated 11 years ago
- Kaggle's Allstate Purchase Prediction Challenge☆88Updated 7 years ago
- Finding document vectors from pre-trained word2vec word vectors☆115Updated 9 years ago
- Multi-core implementation of Regularized Greedy Forest☆464Updated 6 years ago
- Hashed Factorization Machine with Follow The Regularized Leader for Kaggle Avazu Click-Through Rate Competition☆260Updated 8 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆166Updated 8 years ago