Machine learning with dataframes
☆1,568Feb 23, 2026Updated last week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.☆1,379Feb 12, 2026Updated 2 weeks ago
- Track your Data Science. Skore's open-source Python library accelerates ML model development with automated evaluation reports, smart met…☆589Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 2 weeks ago
- skops is a Python library helping you share your scikit-learn based models and put them in production☆512Feb 2, 2026Updated last month
- Competing Risks and Survival Analysis☆115Sep 23, 2025Updated 5 months ago
- Feature engineering and selection open-source Python library compatible with sklearn.☆2,204Updated this week
- A library of sklearn compatible categorical variable encoders☆2,482Updated this week
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,520Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,395Feb 19, 2025Updated last year
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,573Feb 4, 2026Updated 3 weeks ago
- A unified framework for machine learning with time series☆9,544Feb 20, 2026Updated last week
- Data Analysis Baseline Library☆727Dec 16, 2024Updated last year
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,533Updated this week
- Natural Intelligence is still a pretty good idea.☆826Jul 15, 2024Updated last year
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- 🌊 Online machine learning in Python☆5,726Feb 9, 2026Updated 3 weeks ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,119Jan 24, 2026Updated last month
- A light-weight, flexible, and expressive statistical data testing library☆4,212Feb 19, 2026Updated last week
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,085Feb 2, 2026Updated last month
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,166Oct 4, 2025Updated 4 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,480Updated this week
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,698Updated this week
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆677Feb 19, 2025Updated last year