skrub-data / skrubLinks
Machine learning with dataframes
โ1,459Updated this week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.โ1,360Updated 2 weeks ago
- ๐ข๐๐ป ๐ฌ๐ผ๐๐ฟ ๐๐ฎ๐๐ฎ ๐ฆ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ฒ. Skore's open-source Python library accelerates ML model development with automated evaluation reโฆโ529Updated last week
- Feature engineering package with sklearn like functionalityโ2,132Updated 3 weeks ago
- Interpretable ML package ๐ for concise, transparent, and accurate predictive modeling (sklearn-compatible).โ1,500Updated last month
- Natural Intelligence is still a pretty good idea.โ821Updated last year
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.โ1,464Updated last week
- Fast SHAP value computation for interpreting tree-based modelsโ542Updated 2 years ago
- A scikit-learn-compatible module for comparing imputation methods.โ141Updated last month
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sโฆโ703Updated last week
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadโฆโ667Updated 7 months ago
- skops is a Python library helping you share your scikit-learn based models and put them in productionโ499Updated this week
- EvalML is an AutoML library written in python.โ829Updated 2 weeks ago
- A set of data tools in Pythonโ505Updated this week
- Predictive Power Score (PPS) in Pythonโ1,158Updated 8 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictionsโ310Updated 5 months ago
- Data Quality assessment with one line of codeโ450Updated 3 weeks ago
- Clean APIs for data cleaning. Python implementation of R package Janitorโ1,456Updated last week
- Doubt your data, find bad labels.โ514Updated last year
- Monitor the stability of a Pandas or Spark dataframe โ๏ธโ505Updated 3 weeks ago
- machine learning with logical rules in Pythonโ645Updated last year
- Statistical package in Python based on Pandasโ1,829Updated last month
- Multivariate exploratory data analysis in Python โ PCA, CA, MCA, MFA, FAMD, GPAโ1,400Updated last month
- A Python package for causal inference in quasi-experimental settingsโ1,037Updated this week
- Temporian is an open-source Python library for preprocessing โก and feature engineering ๐ temporal data ๐ for machine learning applicatiโฆโ703Updated last year
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.โ634Updated last year
- A drop-in replacement for Scikit-Learnโs GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.โ468Updated last year
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.โ469Updated last week
- Human-explainable AI.โ526Updated 2 weeks ago
- ๐ Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Modelsโ2,953Updated 2 weeks ago
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.โ1,137Updated last week