Pandas integration with sklearn
☆2,852Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for sklearn-pandas
Users that are interested in sklearn-pandas are comparing it to the libraries listed below
Sorting:
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,119Jan 24, 2026Updated last month
- A library of sklearn compatible categorical variable encoders☆2,484Updated this week
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,085Feb 2, 2026Updated last month
- Hyper-parameter optimization for sklearn☆1,646Apr 15, 2025Updated 10 months ago
- Automated Machine Learning with scikit-learn☆8,062Jan 20, 2026Updated last month
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,396Feb 19, 2025Updated last year
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,046Sep 11, 2025Updated 5 months ago
- An open source python library for automated feature engineering☆7,617Feb 3, 2026Updated last month
- An intuitive library to add plotting functionality to scikit-learn objects.☆2,432Aug 20, 2024Updated last year
- Sequential model-based optimization with a `scipy.optimize` interface☆2,815Feb 23, 2024Updated 2 years ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,772Feb 10, 2026Updated 3 weeks ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,399Feb 27, 2026Updated last week
- Open source time series library for Python☆2,140Oct 24, 2023Updated 2 years ago
- Python implementations of the Boruta all-relevant feature selection method.☆1,621Nov 13, 2025Updated 3 months ago
- Automatic extraction of relevant features from time series:☆9,127Nov 15, 2025Updated 3 months ago
- Extra blocks for scikit-learn pipelines.☆1,382Updated this week
- Describing statistical models in Python using symbolic formulas☆978Feb 23, 2026Updated last week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated 3 weeks ago
- A scikit-learn compatible neural network library that wraps PyTorch☆6,152Feb 25, 2026Updated last week
- Parallel computing with task scheduling☆13,754Updated this week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,066Updated this week
- Declarative visualization library for Python☆10,276Feb 27, 2026Updated last week
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆489Aug 11, 2017Updated 8 years ago
- Lime: Explaining the predictions of any machine learning classifier☆12,101Jul 25, 2024Updated last year
- NumPy and Pandas interface to Big Data☆3,196Sep 29, 2023Updated 2 years ago
- Missing data visualization module for Python.☆4,196May 14, 2024Updated last year
- Statsmodels: statistical modeling and econometrics in Python☆11,279Updated this week
- Distributed Asynchronous Hyperparameter Optimization in Python☆7,607Feb 8, 2026Updated 3 weeks ago
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆467Feb 27, 2025Updated last year
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,169Aug 30, 2021Updated 4 years ago
- A python library for decision tree visualization and model interpretation.☆3,125Jan 2, 2026Updated 2 months ago
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,124Updated this week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- A high performance implementation of HDBSCAN clustering.☆3,077Jan 26, 2026Updated last month
- Bayesian Modeling and Probabilistic Programming in Python☆9,514Updated this week
- Plotting library for IPython/Jupyter notebooks☆3,683Jan 23, 2026Updated last month
- Scalable Machine Learning with Dask☆945Sep 27, 2025Updated 5 months ago
- A game theoretic approach to explain the output of any machine learning model.☆25,079Updated this week
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆8,825Updated this week