MiyainNYC / Distributed-Machine-LearningLinks
PySpark, Databrick, h2o, MLlib
☆19Updated 9 years ago
Alternatives and similar repositories for Distributed-Machine-Learning
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
Sorting:
- Jupyter Notebooks for supplychainpy☆23Updated 8 years ago
- Work for Mastering Large Datasets with Python☆20Updated 2 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆48Updated 7 years ago
- ☆19Updated 4 years ago
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- ☆26Updated 7 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 8 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- ☆14Updated 6 years ago
- Dynamic pricing for selling perishable goods☆64Updated 7 years ago
- This is a repo for all the tutorials put out by H2O.ai. This includes learning paths for Driverless AI, H2O-3, Sparkling Water and more..…☆134Updated last year
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 5 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆63Updated 2 years ago
- Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/☆27Updated last year
- ☆39Updated 8 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- Walkthrough notebooks for Deep Learning, Machine Learning, Reinforcement Learning, Spark, Statistics, Algorithms, Scala, Python☆70Updated 2 years ago
- Hypothesis and statistical testing in Python☆65Updated 5 years ago
- Presentation + Jupyter Notebook from PyGotham July 2016☆35Updated 2 years ago
- Reddit Data Science Project Ideas☆10Updated 5 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆25Updated 6 years ago
- ☆26Updated 5 years ago
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆85Updated 2 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 5 years ago