skrub-data / skrubLinks
Machine learning with dataframes
โ1,428Updated this week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.โ1,349Updated 2 weeks ago
- Interpretable ML package ๐ for concise, transparent, and accurate predictive modeling (sklearn-compatible).โ1,482Updated last week
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.โ1,435Updated this week
- Natural Intelligence is still a pretty good idea.โ817Updated 11 months ago
- ๐ข๐๐ป ๐ฌ๐ผ๐๐ฟ ๐๐ฎ๐๐ฎ ๐ฆ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ฒ. Skore's open-source Python library accelerates ML model development with automated evaluation reโฆโ492Updated this week
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sโฆโ700Updated last month
- Feature engineering package with sklearn like functionalityโ2,089Updated this week
- Fast SHAP value computation for interpreting tree-based modelsโ538Updated 2 years ago
- skops is a Python library helping you share your scikit-learn based models and put them in productionโ491Updated 3 weeks ago
- A set of data tools in Pythonโ504Updated 6 months ago
- EvalML is an AutoML library written in python.โ816Updated this week
- Clean APIs for data cleaning. Python implementation of R package Janitorโ1,429Updated this week
- Predictive Power Score (PPS) in Pythonโ1,152Updated 6 months ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadโฆโ657Updated 4 months ago
- Doubt your data, find bad labels.โ513Updated 11 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictionsโ299Updated 2 months ago
- A scikit-learn-compatible module for comparing imputation methods.โ137Updated last month
- A drop-in replacement for Scikit-Learnโs GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.โ467Updated last year
- A Python package for causal inference in quasi-experimental settingsโ1,015Updated this week
- Monitor the stability of a Pandas or Spark dataframe โ๏ธโ503Updated 5 months ago
- ๐ฆ Quickly annotate data from the comfort of your Jupyter notebookโ786Updated last year
- Temporian is an open-source Python library for preprocessing โก and feature engineering ๐ temporal data ๐ for machine learning applicatiโฆโ695Updated 11 months ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.โ464Updated last month
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.โ618Updated last year
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.โ1,119Updated last year
- Multiple Imputation with LightGBM in Pythonโ384Updated 11 months ago
- nannyml: post-deployment data science in pythonโ2,082Updated 2 months ago
- Data Analysis Baseline Libraryโ728Updated 6 months ago
- machine learning with logical rules in Pythonโ641Updated last year
- Multivariate exploratory data analysis in Python โ PCA, CA, MCA, MFA, FAMD, GPAโ1,376Updated 2 weeks ago