Feature engineering and selection open-source Python library compatible with sklearn.
☆2,204Updated this week
Alternatives and similar repositories for feature_engine
Users that are interested in feature_engine are comparing it to the libraries listed below
Sorting:
- Extra blocks for scikit-learn pipelines.☆1,379Feb 12, 2026Updated 2 weeks ago
- Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…☆677Feb 19, 2025Updated last year
- An open source python library for automated feature engineering☆7,614Feb 3, 2026Updated 3 weeks ago
- EvalML is an AutoML library written in python.☆845Jan 14, 2026Updated last month
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,573Feb 4, 2026Updated 3 weeks ago
- A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.☆1,519Feb 19, 2026Updated last week
- A Tree based feature selection tool which combines both the Boruta feature selection algorithm with shapley values.☆646Feb 19, 2024Updated 2 years ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,395Feb 19, 2025Updated last year
- Linear Prediction Model with Automated Feature Engineering and Selection Capabilities☆537Jan 6, 2026Updated last month
- A unified framework for machine learning with time series☆9,544Feb 20, 2026Updated last week
- Python implementations of the Boruta all-relevant feature selection method.☆1,620Nov 13, 2025Updated 3 months ago
- 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models☆3,147Feb 6, 2026Updated 3 weeks ago
- Automatic extraction of relevant features from time series:☆9,119Nov 15, 2025Updated 3 months ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,116Jan 24, 2026Updated last month
- An open-source, low-code machine learning library in Python☆9,700Apr 21, 2025Updated 10 months ago
- Machine learning with dataframes☆1,568Updated this week
- A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.☆2,034Jun 5, 2025Updated 8 months ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,979Dec 28, 2025Updated last month
- Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ra…☆765Aug 20, 2024Updated last year
- Time series forecasting with machine learning models☆1,445Updated this week
- Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation☆3,243Jul 7, 2025Updated 7 months ago
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,698Updated this week
- A Python library that helps data scientists to infer causation rather than observing correlation.☆2,439Jun 26, 2024Updated last year
- 🌊 Online machine learning in Python☆5,726Feb 9, 2026Updated 2 weeks ago
- Algorithms for outlier, adversarial and drift detection☆2,493Dec 11, 2025Updated 2 months ago
- Automated Time Series Forecasting☆1,373Feb 20, 2026Updated last week
- Calculates various features from time series data. Python implementation of the R package tsfeatures.☆440Apr 23, 2024Updated last year
- A python package for simultaneous Hyperparameters Tuning and Features Selection for Gradient Boosting Models.☆585Jun 8, 2024Updated last year
- Scalable machine 🤖 learning for time series forecasting.☆1,171Updated this week
- A python library for user-friendly forecasting and anomaly detection on time series.☆9,224Updated this week
- The machine learning toolkit for time series analysis in Python☆3,104Feb 19, 2026Updated last week
- STUMPY is a powerful and scalable Python library for modern time series analysis☆4,065Updated this week
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,082Feb 2, 2026Updated 3 weeks ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,389Feb 2, 2026Updated 3 weeks ago
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆4,286Updated this week
- Algorithms for explaining machine learning models☆2,610Oct 17, 2025Updated 4 months ago
- Natural Gradient Boosting for Probabilistic Prediction☆1,833Updated this week
- Fit interpretable models. Explain blackbox machine learning.☆6,802Updated this week
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆7,227Updated this week