skrub-data / skrub
Machine learning with dataframes
☆1,355Updated this week
Alternatives and similar repositories for skrub:
Users that are interested in skrub are comparing it to the libraries listed below
- Extra blocks for scikit-learn pipelines.☆1,318Updated this week
- Feature engineering package with sklearn like functionality☆2,035Updated this week
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,369Updated this week
- Natural Intelligence is still a pretty good idea.☆808Updated 9 months ago
- Predictive Power Score (PPS) in Python☆1,147Updated 3 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆697Updated last month
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,412Updated this week
- Fast SHAP value computation for interpreting tree-based models☆539Updated last year
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆639Updated 2 months ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆928Updated this week
- Data Analysis Baseline Library☆726Updated 4 months ago
- Doubt your data, find bad labels.☆511Updated 9 months ago
- nannyml: post-deployment data science in python☆2,050Updated 3 months ago
- Statistical package in Python based on Pandas☆1,731Updated last month
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆453Updated this week
- the scikit-learn sidekick☆395Updated this week
- A Python package for causal inference in quasi-experimental settings☆972Updated this week
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,442Updated last month
- just a bunch of useful embeddings for scikit-learn pipelines☆496Updated 3 weeks ago
- Data Quality assessment with one line of code☆437Updated 2 weeks ago
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,181Updated this week
- 🐦 Quickly annotate data from the comfort of your Jupyter notebook☆784Updated last year
- machine learning with logical rules in Python☆634Updated last year
- Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Gra…☆1,796Updated 10 months ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆509Updated 3 months ago
- skops is a Python library helping you share your scikit-learn based models and put them in production☆474Updated this week
- A template for scikit-learn extensions☆332Updated 2 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆500Updated 2 months ago
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆389Updated 3 months ago
- Algorithms for outlier, adversarial and drift detection☆2,345Updated 3 weeks ago