MiyainNYC / Distributed-Machine-Learning
PySpark, Databrick, h2o, MLlib
☆18Updated 8 years ago
Alternatives and similar repositories for Distributed-Machine-Learning:
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
- Recommender Systems, Social Network Analysis, static & dynamic graph modeling, Neo4j, igraph, networkX☆9Updated 7 years ago
- In this Facebook live code along session with Hugo Bowne-Anderson, you're going to check out Google trends data of keywords 'diet', 'gym'…☆44Updated 7 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Jupyter Notebooks for supplychainpy☆22Updated 7 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- Teaching materials for the text analytics course☆19Updated 6 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 7 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- A machine learning algorithm written to predict severity of insurance claim☆19Updated 8 years ago
- ☆19Updated 3 years ago
- Few tutorials on pandas, matplotlib and seaborn☆26Updated 8 years ago
- Reddit Data Science Project Ideas☆9Updated 5 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- CentOS based Docker container for Time Series Analysis and Modeling.☆21Updated 5 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆45Updated 6 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Hierarchical Clustering Algorithms☆35Updated 2 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Forecasting Uber demand in NYC neighborhoods☆34Updated 6 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Introduction to Pandas, Scikit-Learn and Keras☆13Updated 5 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- ☆25Updated 7 years ago
- Code Snippets & DataSets for Business Analytics & Data Mining/ Machine Learning Algorithms☆16Updated 6 years ago