rezazadeh / spark
Mirror of Apache Spark
☆24Updated 9 years ago
Alternatives and similar repositories for spark:
Users that are interested in spark are comparing it to the libraries listed below
- Using stochastic gradient descent (SGD) with explicit and implicit updates to fit large-scale statistical models.☆16Updated 10 years ago
- Gopalan, P., Ruiz, F. J., Ranganath, R., & Blei, D. M. (2014). Bayesian Nonparametric Poisson Factorization for Recommendation Systems. I…☆15Updated 10 years ago
- Large Scale Machine learning Optimization through Stochastic Average Gradient☆9Updated 9 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- National Data Science Bowl☆20Updated 10 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Quick & dirty repo for hosting the Notebook for t-SNE presentation at delivered at Python Quants and PyData London meetups☆9Updated 9 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- ☆25Updated 9 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆26Updated 8 years ago
- Topic analysis using RSM or PVDM.☆11Updated 10 years ago
- Scalable inference for Correlated Topic Models☆30Updated 10 years ago
- MLSS 2016 material.☆22Updated 8 years ago
- Mirror of Apache Spark (With R Frontend on Spark Streaming)☆11Updated 9 years ago
- Tutorial introducing Monte Carlo integration and Markov Chain Monte Carlo☆52Updated 12 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20Updated 6 years ago
- Collaborative filtering with the GP-LVM☆25Updated 9 years ago
- Fast, principled L1-regularized loss minimization☆24Updated last year
- TBEEF, a doubly ensemble framework for recommendation and prediction problems.☆20Updated 9 years ago
- ☆14Updated 9 years ago
- Material for open source machine learning practical☆21Updated 9 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- FTRL-Proximal Online Learning Algorithm☆15Updated 7 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 10 years ago
- A quick educational implementation of a random forest classifier and a decsion jungle classifier.☆28Updated 10 years ago
- ☆46Updated 11 years ago
- Run Nx2 Cross Validation for multiple binary classifiers in parallel with optional downsampling☆13Updated 10 years ago
- GSOC 2017 - Apache Organization - # Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python…☆14Updated 8 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Various notebooks and tutorials on subjects of interest.☆36Updated 4 years ago