Machine learning with dataframes
☆1,597Apr 23, 2026Updated last week
Alternatives and similar repositories for skrub
Users that are interested in skrub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Track your Data Science. Skore's open-source Python library accelerates ML model development with automated evaluation reports, smart met…☆635Updated this week
- Extra blocks for scikit-learn pipelines.☆1,392Apr 21, 2026Updated last week
- Competing Risks and Survival Analysis☆117Sep 23, 2025Updated 7 months ago
- skops is a Python library helping you share your scikit-learn based models and put them in production☆513Apr 20, 2026Updated last week
- just a bunch of useful embeddings for scikit-learn pipelines☆524Feb 12, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Feature engineering and selection open-source Python library compatible with sklearn.☆2,233Mar 28, 2026Updated last month
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,541Updated this week
- A library of sklearn compatible categorical variable encoders☆2,489Mar 1, 2026Updated 2 months ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,597Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,398Feb 19, 2025Updated last year
- A unified framework for machine learning with time series☆9,745Updated this week
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,585Apr 13, 2026Updated 2 weeks ago
- Repository for CARTE: Context-Aware Representation of Table Entries☆170Aug 11, 2025Updated 8 months ago
- Data Analysis Baseline Library☆728Dec 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,101Apr 13, 2026Updated 2 weeks ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,138Jan 24, 2026Updated 3 months ago
- 📊 Explain why metrics change by unpacking them☆41Jan 16, 2026Updated 3 months ago
- Natural Intelligence is still a pretty good idea.☆831Mar 9, 2026Updated last month
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,764Updated this week
- Similarity encoding of dirty categorical variables (strings)☆20Jan 22, 2019Updated 7 years ago
- 🌊 Online machine learning in Python☆5,799Updated this week
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,169Apr 24, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A light-weight, flexible, and expressive statistical data testing library☆4,317Updated this week
- 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models☆3,208Updated this week
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆677Feb 19, 2025Updated last year
- EvalML is an AutoML library written in python.☆846Jan 14, 2026Updated 3 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,489Apr 11, 2026Updated 2 weeks ago
- machine learning with logical rules in Python☆659Jan 31, 2024Updated 2 years ago
- Survival analysis built on top of scikit-learn☆1,294Apr 7, 2026Updated 3 weeks ago
- TabICLv2: A state-of-the-art tabular foundation model☆810Updated this week
- An open source python library for automated feature engineering☆7,630Feb 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Python package to assess and improve fairness of machine learning models.☆2,230Apr 20, 2026Updated last week
- Statistical package in Python based on Pandas☆1,908Apr 5, 2026Updated 3 weeks ago
- STUMPY is a powerful and scalable Python library for modern time series analysis☆4,083Apr 4, 2026Updated 3 weeks ago
- Fit interpretable models. Explain blackbox machine learning.☆6,840Updated this week
- Gain clues from clustering!☆322Jul 16, 2024Updated last year
- A scikit-learn compatible neural network library that wraps PyTorch☆6,159Updated this week
- KEN: Relational Data Embeddings☆34Jan 2, 2024Updated 2 years ago