Machine learning with dataframes
☆1,576Mar 6, 2026Updated this week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.☆1,382Mar 1, 2026Updated last week
- Track your Data Science. Skore's open-source Python library accelerates ML model development with automated evaluation reports, smart met…☆599Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆523Feb 12, 2026Updated last month
- skops is a Python library helping you share your scikit-learn based models and put them in production☆512Feb 2, 2026Updated last month
- Competing Risks and Survival Analysis☆116Sep 23, 2025Updated 5 months ago
- Feature engineering and selection open-source Python library compatible with sklearn.☆2,211Updated this week
- A library of sklearn compatible categorical variable encoders☆2,486Mar 1, 2026Updated last week
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,523Mar 5, 2026Updated last week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,396Feb 19, 2025Updated last year
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,573Feb 24, 2026Updated 2 weeks ago
- A unified framework for machine learning with time series☆9,609Updated this week
- Data Analysis Baseline Library☆728Dec 16, 2024Updated last year
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,554Updated this week
- Natural Intelligence is still a pretty good idea.☆826Jul 15, 2024Updated last year
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- 🌊 Online machine learning in Python☆5,746Updated this week
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,119Jan 24, 2026Updated last month
- A light-weight, flexible, and expressive statistical data testing library☆4,218Feb 19, 2026Updated 3 weeks ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,087Feb 2, 2026Updated last month
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,167Oct 4, 2025Updated 5 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,485Mar 4, 2026Updated last week
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,716Updated this week
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆677Feb 19, 2025Updated last year
- 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models☆3,151Feb 6, 2026Updated last month
- STUMPY is a powerful and scalable Python library for modern time series analysis☆4,070Feb 22, 2026Updated 2 weeks ago
- Statistical package in Python based on Pandas☆1,881Updated this week
- machine learning with logical rules in Python☆658Jan 31, 2024Updated 2 years ago
- EvalML is an AutoML library written in python.☆845Jan 14, 2026Updated last month
- Fit interpretable models. Explain blackbox machine learning.☆6,803Mar 5, 2026Updated last week
- Survival analysis built on top of scikit-learn☆1,277Updated this week
- Scalable machine 🤖 learning for time series forecasting.☆1,185Updated this week
- An open source python library for automated feature engineering☆7,621Feb 3, 2026Updated last month
- Gain clues from clustering!☆320Jul 16, 2024Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,773Feb 10, 2026Updated last month
- DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a uni…☆7,985Mar 5, 2026Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,411Mar 3, 2026Updated last week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,492Mar 1, 2026Updated last week
- Repository for CARTE: Context-Aware Representation of Table Entries☆166Aug 11, 2025Updated 7 months ago
- A scikit-learn compatible neural network library that wraps PyTorch☆6,151Feb 25, 2026Updated 2 weeks ago