dirty-data-science / python
Tutorial material on machine learning with dirty data in Python
☆62Updated 7 months ago
Alternatives and similar repositories for python:
Users that are interested in python are comparing it to the libraries listed below
- data⎰describe: Pythonic EDA Accelerator for Data Science☆299Updated last year
- General Interpretability Package☆58Updated 2 years ago
- Data Analysis Baseline Library☆130Updated 3 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- bayes-toolbox☆93Updated 2 months ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 2 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆113Updated 2 years ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- A Python library for time series forecasting☆82Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆83Updated last year
- python library for automated dataset normalization☆113Updated last year
- In which I put together my thoughts on the practice of data science.☆292Updated last year
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆106Updated 5 years ago
- PyData London 2022 Tutorial☆66Updated 2 years ago
- Clustergram - Visualization and diagnostics for cluster analysis in Python☆123Updated last month
- How to Interpret SHAP Analyses: A Non-Technical Guide☆51Updated 3 years ago
- scikit-learn compatible tools for building credit risk acceptance models☆94Updated last week
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- Time should be taken seer-iously☆313Updated last year
- Repo for the ML_Insights python package☆149Updated last year
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆42Updated last year
- pymc-learn: Practical probabilistic machine learning in Python☆226Updated 4 years ago
- In which I play with the ideas surrounding causality☆51Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- ☆231Updated 3 years ago
- sidetable builds simple but useful summary tables of your data☆387Updated 2 years ago
- How to use SHAP values for better cluster analysis☆55Updated 2 years ago
- Hierarchical Time Series Forecasting with a familiar API☆224Updated last year