cpa-analytics / embedding-encoder
Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.
☆42Updated last year
Alternatives and similar repositories for embedding-encoder:
Users that are interested in embedding-encoder are comparing it to the libraries listed below
- TabNet for fastai☆123Updated 9 months ago
- An unsupervised feature selection technique using supervised algorithms such as XGBoost☆89Updated last year
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 2 years ago
- Learn Pyro through the M5 forecasting competition☆84Updated 4 years ago
- Small Dataset Benchmarks on the Dataset of Datasets UCI++☆86Updated 2 years ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆65Updated 8 months ago
- Random Forest or XGBoost? It is Time to Explore LCE☆67Updated last year
- Fast implementation of Venn-ABERS probabilistic predictors☆72Updated 11 months ago
- An extension of CatBoost to probabilistic modelling☆142Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Helpers for scikit learn☆16Updated 2 years ago
- Example usage of scikit-hts☆57Updated 2 years ago
- Probabilistic Gradient Boosting Machines☆144Updated 11 months ago
- Hierarchical Time Series Forecasting with a familiar API☆223Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Phi_K correlation analyzer library☆157Updated this week
- Clustering for mixed-type data☆96Updated 5 months ago
- hgboost is a python package for hyper-parameter optimization for xgboost, catboost or lightboost using cross-validation, and evaluating t…☆61Updated 3 months ago
- Pipeline components that support partial_fit.☆44Updated 6 months ago
- Advanced random forest methods in Python☆58Updated last year
- Benchmark tabular Deep Learning models against each other and other non-DL techniques☆53Updated 3 years ago
- Modification of TabNet as suggested in the Medium article, "The Unreasonable Ineffectiveness of Deep Learning on Tabular Data"☆61Updated last year
- Train multi-task image, text, or ensemble (image + text) models☆45Updated last year
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Updated 2 years ago
- A presention of core concepts and a data generator making easier using tabular data with TensorFlow and Keras☆41Updated last year
- Time Series package for fastai v2☆94Updated last year
- Batch shap calculations.☆31Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Improved TabNet for TensorFlow☆52Updated 2 years ago