MiyainNYC / Distributed-Machine-LearningLinks
PySpark, Databrick, h2o, MLlib
☆19Updated 8 years ago
Alternatives and similar repositories for Distributed-Machine-Learning
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
Sorting:
- Jupyter Notebooks for supplychainpy☆22Updated 8 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Recommender Systems, Social Network Analysis, static & dynamic graph modeling, Neo4j, igraph, networkX☆9Updated 7 years ago
- Examples of implementations of WTTE-RNN☆32Updated 7 years ago
- Work for Mastering Large Datasets with Python☆19Updated 2 years ago
- Predict whether a loan will be repaid using automated feature engineering.☆63Updated last year
- Presentation + Jupyter Notebook from PyGotham July 2016☆35Updated last year
- ☆26Updated 7 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Survival Analysis with non-parametric, semi-parametric, and parametric models☆39Updated 7 years ago
- Examples how MLJAR can be used☆60Updated last year
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 4 years ago
- Structural Time Series on US electricity demand data☆22Updated 4 years ago
- Data Analysis and Machine Learning with Python: EDA with ECDF and Correlation analysis, Preprocessing and Feature engineering, L1 (Lasso)…☆33Updated 7 years ago
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- This is a repo for all the tutorials put out by H2O.ai. This includes learning paths for Driverless AI, H2O-3, Sparkling Water and more..…☆134Updated 11 months ago
- Hypothesis and statistical testing in Python☆64Updated 4 years ago
- Content associated with a PyData Seattle 2017 tutorial on Unevenly spaced time series analysis of The Simpsons using pandas☆15Updated 8 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆47Updated 7 years ago
- Various methods for generating synthetic data for data science and ML☆80Updated 3 years ago
- Smart, automatic detection and stationarization of non-stationary time series data.☆29Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆68Updated 7 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- ☆14Updated 6 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 5 years ago