skrub-data / skrub
Prepping tables for machine learning
☆1,322Updated this week
Alternatives and similar repositories for skrub:
Users that are interested in skrub are comparing it to the libraries listed below
- Extra blocks for scikit-learn pipelines.☆1,312Updated this week
- Natural Intelligence is still a pretty good idea.☆806Updated 8 months ago
- Feature engineering package with sklearn like functionality☆2,012Updated this week
- Doubt your data, find bad labels.☆509Updated 8 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆696Updated last week
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,350Updated this week
- A Python package for causal inference in quasi-experimental settings☆958Updated this week
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆499Updated last month
- Fast SHAP value computation for interpreting tree-based models☆535Updated last year
- Data Analysis Baseline Library☆726Updated 3 months ago
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,441Updated 2 weeks ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆630Updated last month
- EvalML is an AutoML library written in python.☆804Updated this week
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,402Updated this week
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆443Updated this week
- Lightweight and extensible compatibility layer between dataframe libraries!☆884Updated this week
- skops is a Python library helping you share your scikit-learn based models and put them in production☆467Updated 2 weeks ago
- the scikit-learn sidekick☆341Updated this week
- A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.☆1,958Updated 2 weeks ago
- BAyesian Model-Building Interface (Bambi) in Python.☆1,141Updated 3 weeks ago
- Algorithms for outlier, adversarial and drift detection☆2,322Updated this week
- A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.☆468Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆277Updated this week
- Predictive Power Score (PPS) in Python☆1,132Updated 2 months ago
- machine learning with logical rules in Python☆630Updated last year
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.☆610Updated last year
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆781Updated 11 months ago
- A light-weight, flexible, and expressive statistical data testing library☆3,688Updated 2 weeks ago
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,084Updated 8 months ago
- Time should be taken seer-iously☆314Updated 2 years ago