dirty-cat / dirty_catLinks
Machine learning on dirty tabular data (legacy clone of skrub)
☆20Updated 9 months ago
Alternatives and similar repositories for dirty_cat
Users that are interested in dirty_cat are comparing it to the libraries listed below
Sorting:
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 4 months ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆156Updated 2 months ago
- implementation of Cyclic Boosting machine learning algorithms☆94Updated last year
- Rethinking machine learning pipelines☆34Updated 2 months ago
- Competing Risks and Survival Analysis☆112Updated 2 months ago
- Turn SciKitLearn pipelines into SQL☆106Updated 2 weeks ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last month
- ☆41Updated last year
- Time based splits for cross validation☆39Updated 2 weeks ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 5 months ago
- A more flexible alternative to scikit-learn Pipelines☆38Updated 6 months ago
- DataFrame support for scikit-learn.☆63Updated 3 months ago
- Pipeline components that support partial_fit.☆46Updated last year
- An abstraction layer for parameter tuning☆35Updated last month
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆139Updated this week
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- Tutorial for implementing data validation in data science pipelines☆33Updated 3 years ago
- Tools for diagnostics and assessment of (machine learning) models☆39Updated last month
- Python package implementing ML feature engineering and pre-processing for polars or pandas dataframes.☆80Updated this week
- Parallel processing on pandas with progress bars☆61Updated 5 months ago
- Polars plugin for pairwise distance functions☆92Updated 7 months ago
- Polars Time Series Extension☆32Updated last month
- A FastMCP tool to search and retrieve Polars API documentation.☆71Updated 6 months ago
- Prune your sklearn models☆19Updated last year
- SciKIt-learn Pipeline in PAndas☆42Updated 2 years ago
- A Pythonic microframework for multi-armed bandit problems☆133Updated this week
- Phi_K correlation analyzer library☆169Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆93Updated last year
- Code and data for the Modern Polars book☆230Updated last year
- DSL for HTML that targets marimo and more!☆67Updated 5 months ago