dirty-data-science / pythonLinks
Tutorial material on machine learning with dirty data in Python
☆60Updated 10 months ago
Alternatives and similar repositories for python
Users that are interested in python are comparing it to the libraries listed below
Sorting:
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- General Interpretability Package☆58Updated 2 years ago
- A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using th…☆115Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Data Analysis Baseline Library☆132Updated 7 months ago
- data⎰describe: Pythonic EDA Accelerator for Data Science☆301Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- One day workshop for machine learning with scikit-learn☆63Updated last year
- bayes-toolbox☆92Updated this week
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- An introduction to Bayesian statistics using Python and (coming soon) R.☆133Updated last year
- Phi_K correlation analyzer library☆164Updated 4 months ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆107Updated 3 years ago
- Missing data amputation and exploration functions for Python☆70Updated 2 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Measure and visualize machine learning model performance without the usual boilerplate.☆97Updated 8 months ago
- ☆134Updated last year
- Templates for jupyter notebooks☆145Updated last year
- python library for automated dataset normalization☆115Updated last year
- Train multi-task image, text, or ensemble (image + text) models☆45Updated last year
- In which I put together my thoughts on the practice of data science.☆295Updated last year
- Clustering for mixed-type data☆99Updated 10 months ago
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Generalized additive models in Python with a Bayesian twist☆78Updated last year
- Automated machine learning: Review of the state-of-the-art and opportunities for healthcare☆41Updated 4 years ago
- Multivariate Boosted TRee☆63Updated 2 years ago
- Wrap-up to automatically tune xgboost in Python.☆80Updated 3 years ago
- A drag-and-drop dashboard editor for JupyterLab☆219Updated 2 years ago
- Decorators that logs stats.☆112Updated 2 months ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆41Updated last year