jingmin1987 / variable-clustering
A re-creation of SAS varclus procedure in Python
☆23Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for variable-clustering
- Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard …☆117Updated 2 years ago
- credit risk score card develop by python(version 3.6)☆41Updated 6 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Updated 6 years ago
- About uplift modeling☆30Updated 8 years ago
- A project on machine learning techniques dealing with imbalanced classification (Python)☆10Updated 7 years ago
- ☆137Updated 5 years ago
- Using Imblearn To Tackle Imbalanced Data Sets☆37Updated 8 years ago
- Exploratory Data Analysis, Dealing with Missing Values, Data Munging, Ensembled Regression Model using Stacked Regressor, XGBoost and mic…☆22Updated 7 years ago
- scikit-learn compatible tools for building credit risk acceptance models☆85Updated 3 months ago
- ☆51Updated 6 years ago
- This is the behavior scorecard, which includes three modules, including data processing, establishment of score card and effect evaluatio…☆19Updated 5 years ago
- Cognitive Computing Final Project☆7Updated 5 years ago
- Kaggle home credit default risk competition☆53Updated 6 years ago
- Examples of how to do feature engineering and Xgboost parameter tuning☆46Updated 8 years ago
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Updated 7 years ago
- Address imbalance classes in machine learning projects.☆66Updated 6 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆34Updated 7 years ago
- Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in t…☆63Updated 4 years ago
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆19Updated last year
- A Python package for variable clustering☆47Updated 3 years ago
- Gradient boosting model for predicting credit default risk on Kaggle competition☆16Updated 3 years ago
- ☆59Updated 5 years ago
- Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop☆92Updated last year
- Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data explorat…☆30Updated 4 years ago
- A python package for feature selection in python☆50Updated 3 years ago
- You work for a consumer finance company which specializes in lending various types of loans to urban customers. When the company receives…☆13Updated 3 years ago
- Credit Risk analysis by using Python and ML☆151Updated 7 years ago
- Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical va…☆14Updated 10 months ago