rakutentech / spark-dirty-cat
Similarity encoding of dirty categorical variables (strings)
โ20Updated 5 years ago
Related projects โ
Alternatives and complementary repositories for spark-dirty-cat
- ๐งฎ Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.โ41Updated 2 years ago
- Pipeline components that support partial_fit.โ43Updated 4 months ago
- ๐ช Bayesian Hierarchical Models at Scaleโ51Updated 3 years ago
- this repo might get acceptedโ29Updated 3 years ago
- Cyclic Boosting Machines - an explainable supervised machine learning algorithmโ59Updated 2 months ago
- Python library to explain Tree Ensemble models (TE) like XGBoost, using a rule list.โ44Updated 7 months ago
- Helpers for scikit learnโ16Updated last year
- Evaluation of early stopping algorithms in A/B testingโ16Updated 7 years ago
- Embed categorical variables via neural networks.โ59Updated last year
- In-Session Personalization Workshop for eCommerce, April 2021, and the MICES Workshop in June 2021.โ22Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsโ104Updated last year
- โ28Updated 5 years ago
- Automatic Feature Engineering for Time Seriesโ17Updated last year
- ๐๐ Lets Python do AB testing analysisโ75Updated 7 months ago
- Repository for my master thesis on automated string handlingโ16Updated 3 years ago
- [Intemarchรฉ] Sales forecasting challengeโ11Updated 3 years ago
- Python package for Bayesian Tests / AB Testingโ40Updated 4 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"โ24Updated 5 years ago
- Prune your sklearn modelsโ19Updated 3 weeks ago
- Multiple ways to model user preference in recommender systemsโ14Updated 6 months ago
- Extra functionalities for riverโ14Updated 6 months ago
- Python package for Bayesian & Frequentist A/B Testingโ12Updated last year
- An experiment on explicit vs implicit feedback recommendersโ25Updated 6 years ago
- Record matching and entity resolution at scale in Sparkโ31Updated last year
- My collection of causal inference algorithms built on top of accessible, simple, out-of-the-box ML methods, aimed at being explainable anโฆโ28Updated last year
- A full example for causal inference on real-world retail data, for elasticity estimationโ45Updated 3 years ago
- Python implementation of R package breakDownโ41Updated last year