skrub-data / skrub
Prepping tables for machine learning
☆1,297Updated this week
Alternatives and similar repositories for skrub:
Users that are interested in skrub are comparing it to the libraries listed below
- Extra blocks for scikit-learn pipelines.☆1,306Updated last month
- Feature engineering package with sklearn like functionality☆1,987Updated 2 weeks ago
- A scikit-learn-compatible module to estimate prediction intervals and control risks based on conformal predictions.☆1,337Updated this week
- Fast SHAP value computation for interpreting tree-based models☆533Updated last year
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆691Updated last month
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,425Updated this week
- Doubt your data, find bad labels.☆508Updated 7 months ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆603Updated 3 weeks ago
- Predictive Power Score (PPS) in Python☆1,122Updated last month
- EvalML is an AutoML library written in python.☆803Updated this week
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,081Updated 7 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,396Updated this week
- Natural Intelligence is still a pretty good idea.☆801Updated 7 months ago
- skops is a Python library helping you share your scikit-learn based models and put them in production☆465Updated this week
- A Python package for causal inference in quasi-experimental settings☆951Updated last week
- Leave One Feature Out Importance☆827Updated this week
- Statistical package in Python based on Pandas☆1,686Updated 2 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,354Updated last month
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.☆606Updated last year
- Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applicati…☆686Updated 6 months ago
- machine learning with logical rules in Python☆629Updated last year
- python partial dependence plot toolbox☆850Updated 5 months ago
- Data Analysis Baseline Library☆728Updated 2 months ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆820Updated this week
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆274Updated last month
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆777Updated 10 months ago
- Pandas DataFrames as Interactive DataTables☆830Updated last week
- Algorithms for outlier, adversarial and drift detection☆2,302Updated 3 weeks ago
- Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.☆630Updated last month
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆498Updated 3 weeks ago