skrub-data / skrubLinks
Machine learning with dataframes
☆1,437Updated this week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.☆1,350Updated last month
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,443Updated this week
- Feature engineering package with sklearn like functionality☆2,101Updated 3 weeks ago
- Natural Intelligence is still a pretty good idea.☆818Updated last year
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,485Updated last month
- 𝗢𝘄𝗻 𝗬𝗼𝘂𝗿 𝗗𝗮𝘁𝗮 𝗦𝗰𝗶𝗲𝗻𝗰𝗲. Skore's open-source Python library accelerates ML model development with automated evaluation re…☆514Updated this week
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆701Updated last week
- Fast SHAP value computation for interpreting tree-based models☆541Updated 2 years ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆660Updated 5 months ago
- Predictive Power Score (PPS) in Python☆1,152Updated 6 months ago
- Doubt your data, find bad labels.☆515Updated last year
- A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.☆467Updated last year
- skops is a Python library helping you share your scikit-learn based models and put them in production☆494Updated this week
- Leave One Feature Out Importance☆836Updated 5 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆504Updated 6 months ago
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,124Updated last year
- Multivariate exploratory data analysis in Python — PCA, CA, MCA, MFA, FAMD, GPA☆1,379Updated last month
- EvalML is an AutoML library written in python.☆820Updated last week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆301Updated 3 months ago
- A Python package for causal inference in quasi-experimental settings☆1,026Updated this week
- A set of data tools in Python☆504Updated 6 months ago
- Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applicati…☆695Updated last year
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆466Updated 2 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,415Updated last month
- A scikit-learn-compatible module for comparing imputation methods.☆137Updated 2 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,440Updated this week
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆786Updated last year
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.☆625Updated last year
- Statistical package in Python based on Pandas☆1,809Updated 4 months ago
- Multiple Imputation with LightGBM in Python☆384Updated last year