jingmin1987 / variable-clustering
A re-creation of SAS varclus procedure in Python
☆23Updated 6 years ago
Alternatives and similar repositories for variable-clustering:
Users that are interested in variable-clustering are comparing it to the libraries listed below
- credit risk score card develop by python(version 3.6)☆40Updated 7 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- A project on machine learning techniques dealing with imbalanced classification (Python)☆11Updated 7 years ago
- Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard …☆117Updated 2 years ago
- Score card model for Credit Scoring System.☆114Updated 6 years ago
- Using Imblearn To Tackle Imbalanced Data Sets☆37Updated 8 years ago
- ☆136Updated 6 years ago
- Exploratory Data Analysis, Dealing with Missing Values, Data Munging, Ensembled Regression Model using Stacked Regressor, XGBoost and mic…☆22Updated 7 years ago
- Solution to Corporación Favorita Grocery Sales Forecasting Competition☆28Updated 7 years ago
- ☆51Updated 6 years ago
- About uplift modeling☆30Updated 8 years ago
- This is the behavior scorecard, which includes three modules, including data processing, establishment of score card and effect evaluatio…☆19Updated 5 years ago
- Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop☆92Updated 2 years ago
- Tuning XGBoost hyper-parameters with Simulated Annealing☆52Updated 8 years ago
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆20Updated last year
- ☆60Updated 6 years ago
- Risk scorecard develop tool welcome to use☆17Updated 3 years ago
- Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in t…☆63Updated 4 years ago
- scikit-learn compatible tools for building credit risk acceptance models☆99Updated 2 months ago
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆35Updated 7 years ago
- Codes and dashboards for 4th place solution for Kaggle's Home Credit Default Risk competition☆31Updated 6 years ago
- Gradient boosting model for predicting credit default risk on Kaggle competition☆16Updated 4 years ago
- ☆34Updated 6 years ago
- Jupyter Notebook used for writing the article "Black-Box models are actually more explainable than a Logistic Regression" published in To…☆73Updated 2 years ago
- Examples of how to do feature engineering and Xgboost parameter tuning☆46Updated 8 years ago
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Updated 7 years ago
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Updated 7 years ago
- A Python package for variable clustering☆50Updated 4 years ago