jingmin1987 / variable-clustering
A re-creation of SAS varclus procedure in Python
☆23Updated 6 years ago
Alternatives and similar repositories for variable-clustering:
Users that are interested in variable-clustering are comparing it to the libraries listed below
- credit risk score card develop by python(version 3.6)☆41Updated 7 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Exploratory Data Analysis, Dealing with Missing Values, Data Munging, Ensembled Regression Model using Stacked Regressor, XGBoost and mic…☆22Updated 7 years ago
- ☆136Updated 6 years ago
- Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in t…☆64Updated 4 years ago
- Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard …☆117Updated 2 years ago
- About uplift modeling☆30Updated 8 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Updated 7 years ago
- A project on machine learning techniques dealing with imbalanced classification (Python)☆11Updated 7 years ago
- Using Imblearn To Tackle Imbalanced Data Sets☆37Updated 8 years ago
- Project work for Udacity's AB Testing Course☆82Updated 7 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- Score card model for Credit Scoring System.☆114Updated 5 years ago
- Gradient boosting model for predicting credit default risk on Kaggle competition☆16Updated 4 years ago
- You work for a consumer finance company which specializes in lending various types of loans to urban customers. When the company receives…☆13Updated 3 years ago
- This is the behavior scorecard, which includes three modules, including data processing, establishment of score card and effect evaluatio…☆19Updated 5 years ago
- Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop☆92Updated 2 years ago
- scikit-learn compatible tools for building credit risk acceptance models☆95Updated last month
- Examples of how to do feature engineering and Xgboost parameter tuning☆46Updated 8 years ago
- Quick Implementation in python☆52Updated 5 years ago
- Jupyter Notebook used for writing the article "Black-Box models are actually more explainable than a Logistic Regression" published in To…☆72Updated 2 years ago
- A binary classification model is developed to predict the probability of paying back a loan by an applicant. Customer previous loan journ…☆21Updated 2 years ago
- ☆49Updated 5 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆25Updated 5 years ago
- A Python package for variable clustering☆50Updated 4 years ago
- Solution to Corporación Favorita Grocery Sales Forecasting Competition☆28Updated 7 years ago
- Ensemble of ARIMA, prophet and LSTMS RNN☆35Updated 7 years ago
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆20Updated last year
- Updated 7 years ago
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆35Updated 7 years ago