hanhanwu / Hanhan_Data_Science_Practice
data analysis, big data development, cloud, and any other cool things!
☆30Updated 7 months ago
Alternatives and similar repositories for Hanhan_Data_Science_Practice:
Users that are interested in Hanhan_Data_Science_Practice are comparing it to the libraries listed below
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- ☆27Updated 6 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Notebook and slides for my talk at Pydata NYC 2018☆88Updated 9 months ago
- Default risk prediction for Home Credit competition - Fast, scalable and maintainable SQL-based feature engineering pipeline☆79Updated 6 years ago
- ☆100Updated 6 years ago
- Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop☆92Updated 2 years ago
- Demand Forecasting Models for Kaggle competition☆81Updated 6 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆25Updated 5 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆20Updated 7 years ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆67Updated 7 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- Personal repository of data science demonstrations and references☆75Updated 2 years ago
- This repository contains Time series Analysis and Forecasting tutorial from Analytics Vidhya☆22Updated 6 years ago
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- ☆33Updated 2 years ago
- This repository contains a notebook demonstrating a practical implementation of the so-called Entity Embedding for Encoding Categorical F…☆74Updated 6 years ago
- ☆51Updated 6 years ago
- Some small utility modules to help with pandas, numpy and sklearn usage in other projects☆22Updated 9 years ago
- ☆22Updated last year
- Club Mahindra DataOlympics 03-05-2019 - 05-05-2019☆21Updated 5 years ago
- Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data explorat…☆30Updated 5 years ago
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Applying automated feature engineering to the Kaggle Home Credit Default Risk Competition☆18Updated 6 years ago
- Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆42Updated 8 years ago
- Guide on creating an API for serving your ML model☆65Updated 2 years ago
- Curated set of transformers that make your work with steppy faster and more effective☆22Updated 6 years ago
- a python based module (bot) to generate kaggle baseline kernels☆26Updated 6 years ago
- ☆45Updated 4 years ago