rezazadeh / sparkLinks
Mirror of Apache Spark
☆24Updated 10 years ago
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below
Sorting:
- ☆28Updated 9 years ago
- National Data Science Bowl☆20Updated 10 years ago
- Using stochastic gradient descent (SGD) with explicit and implicit updates to fit large-scale statistical models.☆16Updated 11 years ago
- Collaborative filtering with the GP-LVM☆25Updated 10 years ago
- ☆36Updated 10 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Updated 8 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20Updated 7 years ago
- Tutorial introducing Monte Carlo integration and Markov Chain Monte Carlo☆52Updated 12 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Updated 9 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 9 years ago
- A quick educational implementation of a random forest classifier and a decsion jungle classifier.☆28Updated 10 years ago
- Recommender systems in Python☆50Updated 10 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Sparse Beta-Divergence Tensor Factorization Library☆48Updated 5 months ago
- Topological Anomaly Detection (TAD) per Gartley and Basener 2009☆68Updated 5 years ago
- Creates models to classify documents into categories☆66Updated 8 years ago
- Ordered Weighted L1 regularization for classification and regression in Python☆52Updated 7 years ago
- an active learning framework in python☆45Updated 9 years ago
- ☆58Updated 9 years ago
- deep inverse regression☆31Updated 10 years ago
- The information sieve for discrete variables.☆36Updated 9 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 8 years ago
- My best submission to the Kaggle competition "Online Product Sales", ranked 21th over 366 teams.☆30Updated 13 years ago
- FTRL-Proximal Online Learning Algorithm☆15Updated 8 years ago
- Fast, principled L1-regularized loss minimization☆24Updated last year
- Fast k-Nearest Neighbors Classifier for Large Datasets☆69Updated 8 years ago
- ☆25Updated 9 years ago
- MLSS 2016 material.☆22Updated 9 years ago
- Scalable inference for Correlated Topic Models☆31Updated 10 years ago