MiyainNYC / Distributed-Machine-LearningLinks
PySpark, Databrick, h2o, MLlib
☆19Updated 9 years ago
Alternatives and similar repositories for Distributed-Machine-Learning
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
Sorting:
- Jupyter Notebooks for supplychainpy☆23Updated 8 years ago
- In this Facebook live code along session with Hugo Bowne-Anderson, you're going to check out Google trends data of keywords 'diet', 'gym'…☆44Updated 7 years ago
- Work for Mastering Large Datasets with Python☆20Updated 2 years ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Updated 6 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆64Updated 2 years ago
- ☆39Updated 8 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- ☆26Updated 7 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Project template for highly effective data science workflows☆29Updated last year
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 5 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- ☆19Updated 4 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Data Science analysis and visualization using Python.☆18Updated 8 years ago
- ☆26Updated 5 years ago
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- ☆14Updated 6 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆25Updated 6 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 5 years ago
- BASM - 2017 Spring☆26Updated 7 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 2 years ago
- Various methods for generating synthetic data for data science and ML☆80Updated 4 years ago
- ☆101Updated 7 years ago
- Tips for Advanced Feature Engineering☆53Updated 5 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆103Updated last week
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆27Updated 5 years ago