dirty-data-science / python
Tutorial material on machine learning with dirty data in Python
☆61Updated 8 months ago
Alternatives and similar repositories for python:
Users that are interested in python are comparing it to the libraries listed below
- data⎰describe: Pythonic EDA Accelerator for Data Science☆299Updated 2 years ago
- Missing data amputation and exploration functions for Python☆67Updated 2 years ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 2 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆115Updated 2 years ago
- PHOTONAI is a high level python API for designing and optimizing machine learning pipelines.☆77Updated 2 months ago
- General Interpretability Package☆58Updated 2 years ago
- Data Analysis Baseline Library☆131Updated 5 months ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated last year
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆105Updated last year
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆83Updated last year
- bayes-toolbox☆93Updated 3 months ago
- One day workshop for machine learning with scikit-learn☆63Updated last year
- Better `keras` models for time series and beyond☆61Updated last year
- ☆230Updated 3 years ago
- A Python library for time series forecasting☆82Updated 2 years ago
- Measure and visualize machine learning model performance without the usual boilerplate.☆97Updated 6 months ago
- The simplest way to deploy a machine learning model☆23Updated 2 years ago
- Advanced random forest methods in Python☆57Updated last year
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- python library for automated dataset normalization☆113Updated last year
- This repo has moved to https://github.com/INRIA/scikit-learn-mooc/☆42Updated 4 years ago
- Generalized additive models in Python with a Bayesian twist☆77Updated 9 months ago
- Clusteval provides methods for unsupervised cluster validation☆58Updated 3 weeks ago
- An introduction to Bayesian statistics using Python and (coming soon) R.☆130Updated last year
- Gradient Boosted Trees + Bayesian Optimization☆25Updated 3 years ago
- Train multi-task image, text, or ensemble (image + text) models☆45Updated last year