hanhanwu / Hanhan_Data_Science_PracticeLinks
data analysis, big data development, cloud, and any other cool things!
☆31Updated last year
Alternatives and similar repositories for Hanhan_Data_Science_Practice
Users that are interested in Hanhan_Data_Science_Practice are comparing it to the libraries listed below
Sorting:
- Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data explorat…☆30Updated 6 years ago
- Notebook and slides for my talk at Pydata NYC 2018☆88Updated last year
- ☆98Updated 7 years ago
- Forecasting Uber demand in NYC neighborhoods☆34Updated 7 years ago
- Demand Forecasting Models for Kaggle competition☆87Updated 7 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated 2 years ago
- Personal repository of data science demonstrations and references☆78Updated 3 years ago
- ☆101Updated 7 years ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆68Updated 7 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 8 years ago
- Quick Implementation in python☆53Updated 6 years ago
- ☆155Updated 5 years ago
- Tips for Advanced Feature Engineering☆53Updated 5 years ago
- ☆136Updated 7 years ago
- No Regrets: A deep dive comparison of bandits and A/B testing☆47Updated 7 years ago
- ☆46Updated 4 years ago
- Learning statistics with Python☆53Updated 4 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆49Updated 7 years ago
- Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop☆91Updated 2 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆26Updated 6 years ago
- Experimenting with and teaching probabilistic programming☆107Updated 3 years ago
- Implementation OF KMEans, KMode, Kprototype and Agllomerative Hierarchical Clustering Using Python.☆35Updated 7 years ago
- The art of effective visualization of multi-dimensional data☆164Updated 7 years ago
- Some work on Kaggle data for fun☆64Updated 8 years ago
- This repository contains a notebook demonstrating a practical implementation of the so-called Entity Embedding for Encoding Categorical F…☆74Updated 6 years ago
- ☆103Updated 2 years ago
- Materials for an online-course - "Practical XGBoost in Python"☆220Updated 9 years ago
- This repository contains Time series Analysis and Forecasting tutorial from Analytics Vidhya☆22Updated 7 years ago
- Material for the Big Data Analytics exercise classes - INFOH515 - Big Data : Distributed Data Management and Scalable Analytics - Univers…☆61Updated 3 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 8 years ago