hanhanwu / Hanhan_Data_Science_Resources
helpful resources for (big) data science
☆33Updated 3 years ago
Alternatives and similar repositories for Hanhan_Data_Science_Resources:
Users that are interested in Hanhan_Data_Science_Resources are comparing it to the libraries listed below
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- pyspark sample scripts☆17Updated 6 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆165Updated 6 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆149Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- ☆11Updated 6 years ago
- ☆102Updated 6 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Content for the Model Interpretability Tutorial at Pycon US 2019☆41Updated 8 months ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- ☆77Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 8 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 5 months ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- Collection of presentation of my work on various platforms and meetups☆22Updated 6 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 9 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- ☆26Updated last year
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated last year
- 57th place solution in "Bosch Production Line Performance"☆19Updated 7 years ago
- A compiled list of kaggle competitions and their winning solutions for sequence problems.☆35Updated 8 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago