skrub-data / skrub
Prepping tables for machine learning
β1,272Updated this week
Alternatives and similar repositories for skrub:
Users that are interested in skrub are comparing it to the libraries listed below
- Extra blocks for scikit-learn pipelines.β1,291Updated this week
- Feature engineering package with sklearn like functionalityβ1,972Updated this week
- Interpretable ML package π for concise, transparent, and accurate predictive modeling (sklearn-compatible).β1,416Updated last week
- Natural Intelligence is still a pretty good idea.β801Updated 6 months ago
- A scikit-learn-compatible module to estimate prediction intervals and control risks based on conformal predictions.β1,316Updated this week
- Statistical package in Python based on Pandasβ1,668Updated last month
- Fast SHAP value computation for interpreting tree-based modelsβ527Updated last year
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadβ¦β596Updated 8 months ago
- Draw datasets from within Jupyter.β830Updated this week
- EvalML is an AutoML library written in python.β797Updated this week
- Doubt your data, find bad labels.β508Updated 6 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitorβ1,383Updated this week
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.β2,348Updated 2 weeks ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β690Updated last week
- Multivariate exploratory data analysis in Python β PCA, CA, MCA, MFA, FAMD, GPAβ1,309Updated last week
- Data Analysis Baseline Libraryβ728Updated last month
- skops is a Python library helping you share your scikit-learn based models and put them in productionβ461Updated last month
- machine learning with logical rules in Pythonβ625Updated 11 months ago
- Monitor the stability of a Pandas or Spark dataframe βοΈβ498Updated 3 months ago
- Leave One Feature Out Importanceβ824Updated last year
- A drop-in replacement for Scikit-Learnβs GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.β467Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictionsβ268Updated 3 weeks ago
- A Python package for causal inference in quasi-experimental settingsβ941Updated this week
- Predictive Power Score (PPS) in Pythonβ1,121Updated last week
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.β601Updated 10 months ago
- A set of data tools in Pythonβ499Updated last week
- π Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Modelsβ2,763Updated last week
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.β422Updated this week
- Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Graβ¦β1,763Updated 7 months ago
- nannyml: post-deployment data science in pythonβ2,013Updated this week