dirty-data-science / python
Tutorial material on machine learning with dirty data in Python
☆62Updated 2 months ago
Related projects: ⓘ
- One day workshop for machine learning with scikit-learn☆63Updated last year
- bayes-toolbox☆93Updated 9 months ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 2 years ago
- ☆127Updated this week
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆105Updated last year
- General Interpretability Package☆58Updated last year
- How to Interpret SHAP Analyses: A Non-Technical Guide☆42Updated 2 years ago
- Missing data amputation and exploration functions for Python☆64Updated last year
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆54Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated last year
- Data Analysis Baseline Library☆130Updated 8 months ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆112Updated last year
- This repo has moved to https://github.com/INRIA/scikit-learn-mooc/☆41Updated 4 years ago
- Phi_K correlation analyzer library☆155Updated last week
- Generalized additive models in Python with a Bayesian twist☆76Updated 3 months ago
- Measure and visualize machine learning model performance without the usual boilerplate.☆94Updated this week
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆75Updated last year
- data⎰describe: Pythonic EDA Accelerator for Data Science☆295Updated last year
- Python port of "Common statistical tests are linear models" by Jonas Kristoffer Lindeløv.☆87Updated 3 weeks ago
- Train multi-task image, text, or ensemble (image + text) models☆45Updated last year
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆50Updated last year
- ☆67Updated this week
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆64Updated 4 months ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆41Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆82Updated 8 months ago
- Clusteval provides methods for unsupervised cluster validation☆55Updated 10 months ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆116Updated last year
- ☆18Updated 9 months ago
- A Python library for time series forecasting☆82Updated last year
- Exploratory repository to study predictive survival analysis models☆30Updated last year