dirty-data-science / python
Tutorial material on machine learning with dirty data in Python
☆60Updated 10 months ago
Alternatives and similar repositories for python:
Users that are interested in python are comparing it to the libraries listed below
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆107Updated 3 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆115Updated 2 years ago
- bayes-toolbox☆92Updated 5 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- One day workshop for machine learning with scikit-learn☆63Updated last year
- Missing data amputation and exploration functions for Python☆68Updated 2 years ago
- Data Analysis Baseline Library☆132Updated 6 months ago
- data⎰describe: Pythonic EDA Accelerator for Data Science☆300Updated 2 years ago
- Logistic regression with bound and linear constraints. L1, L2 and Elastic-Net regularization.☆33Updated 2 years ago
- Phi_K correlation analyzer library☆164Updated 3 months ago
- A python package for time series forecasting with scikit-learn estimators.☆161Updated last year
- General Interpretability Package☆58Updated 2 years ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- pymc-learn: Practical probabilistic machine learning in Python☆228Updated 4 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- A Python library for time series forecasting☆81Updated 2 years ago
- PyData London 2022 Tutorial☆66Updated 2 years ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- An introduction to Bayesian statistics using Python and (coming soon) R.☆131Updated last year
- This repo has moved to https://github.com/INRIA/scikit-learn-mooc/☆42Updated 4 years ago
- Advanced random forest methods in Python☆57Updated last year
- The simplest way to deploy a machine learning model☆23Updated 2 years ago
- A curated list of Python libraries used for data science.☆89Updated 10 months ago
- Generalized additive models in Python with a Bayesian twist☆77Updated 11 months ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆52Updated 4 years ago
- Tutorial on time-series forcasting with scikit-learn☆33Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated last year
- ☆17Updated 3 years ago