MiyainNYC / Distributed-Machine-Learning
PySpark, Databrick, h2o, MLlib
☆18Updated 8 years ago
Alternatives and similar repositories for Distributed-Machine-Learning
Users that are interested in Distributed-Machine-Learning are comparing it to the libraries listed below
Sorting:
- Recommender Systems, Social Network Analysis, static & dynamic graph modeling, Neo4j, igraph, networkX☆9Updated 7 years ago
- ☆14Updated 5 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- ☆17Updated 4 years ago
- Jupyter Notebooks for supplychainpy☆22Updated 8 years ago
- Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑☆51Updated 4 months ago
- A public available dataset for using market sentiment for financial asset allocation.☆23Updated 6 years ago
- Statistical Methods & Applied Mathematics in Data Science, Published by Packt☆27Updated 4 years ago
- My work on UCSD CSE 250B Principles of Artificial Intelligence: Learning Algorithms☆13Updated 5 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 2 years ago
- A machine learning algorithm written to predict severity of insurance claim☆20Updated 8 years ago
- Presentation + Jupyter Notebook from PyGotham July 2016☆35Updated last year
- CentOS based Docker container for Time Series Analysis and Modeling.☆21Updated 5 years ago
- Code repository supporting the medium blog☆13Updated 5 years ago
- Sample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/☆27Updated 10 months ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- A tutorial to create python based prediction web app☆30Updated 5 years ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆67Updated 7 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- ☆15Updated 10 years ago
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 4 years ago
- Deep learning for time-varying multi-entity datasets☆17Updated 7 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- ☆15Updated 4 years ago
- ☆10Updated 8 years ago
- Azure DP-100 Data Scientist Study Guide☆9Updated 4 years ago