dirty-data-science / pythonLinks
Tutorial material on machine learning with dirty data in Python
☆61Updated last year
Alternatives and similar repositories for python
Users that are interested in python are comparing it to the libraries listed below
Sorting:
- data⎰describe: Pythonic EDA Accelerator for Data Science☆302Updated 2 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆116Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- Data Analysis Baseline Library☆133Updated 11 months ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆108Updated 3 years ago
- ☆133Updated last year
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Missing data amputation and exploration functions for Python☆72Updated 2 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- Python package for publishing Jupyter Notebooks as Medium blogposts☆148Updated 2 years ago
- General Interpretability Package☆58Updated 2 years ago
- Phi_K correlation analyzer library☆167Updated this week
- In which I put together my thoughts on the practice of data science.☆302Updated 2 years ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆41Updated 2 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)☆206Updated 3 years ago
- Creates dynamic html report from jupyter notebook.☆329Updated 10 months ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆53Updated 5 years ago
- Measure and visualize machine learning model performance without the usual boilerplate.☆99Updated last year
- 📈 The panel-highcharts package makes it easy to use HighCharts in Python, Notebooks and with HoloViz Panel.☆159Updated 2 years ago
- Better heatmaps in Python☆136Updated 3 years ago
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆225Updated 5 years ago
- sidetable builds simple but useful summary tables of your data☆393Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- This is repo to demonstrate how to convert from Jupyter Notebook to scripts with some engineering practices☆84Updated 2 years ago
- Toolkit for developing and maintaining ML models☆154Updated last year
- Templates for jupyter notebooks☆147Updated last year
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.☆104Updated 3 years ago