MiyainNYC / Distributed-Machine-Learning
PySpark, Databrick, h2o, MLlib
☆18Updated 8 years ago
Alternatives and similar repositories for Distributed-Machine-Learning:
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
- Recommender Systems, Social Network Analysis, static & dynamic graph modeling, Neo4j, igraph, networkX☆8Updated 7 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Hierarchical Clustering Algorithms☆35Updated 2 years ago
- Forecasting Uber demand in NYC neighborhoods☆34Updated 6 years ago
- ☆39Updated 7 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆60Updated last year
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- Interactive dashboard that show a decision support system to help DYCD/DOE’s award RFPs for the 2015 SONYC expansion.☆38Updated 2 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Practical Time Series Analysis (V), published by Packt☆12Updated 4 years ago
- In this Facebook live code along session with Hugo Bowne-Anderson, you're going to check out Google trends data of keywords 'diet', 'gym'…☆44Updated 7 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- ☆15Updated 10 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- CentOS based Docker container for Time Series Analysis and Modeling.☆21Updated 5 years ago
- Project template for highly effective data science workflows☆29Updated 9 months ago
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaign☆27Updated 2 years ago
- A machine learning algorithm written to predict severity of insurance claim☆19Updated 8 years ago
- ☆13Updated 5 years ago
- Jupyter Notebooks for supplychainpy☆22Updated 7 years ago
- Machine learning and process automation☆136Updated 2 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 7 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- ☆11Updated 6 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Structural Time Series on US electricity demand data☆22Updated 4 years ago
- Code repository supporting the medium blog☆13Updated 4 years ago