scikit-learn-contrib / sklearn-pandasView external linksLinks
Pandas integration with sklearn
☆2,848Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for sklearn-pandas
Users that are interested in sklearn-pandas are comparing it to the libraries listed below
Sorting:
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,110Jan 24, 2026Updated 3 weeks ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,078Feb 2, 2026Updated last week
- A library of sklearn compatible categorical variable encoders☆2,479Jan 8, 2026Updated last month
- Hyper-parameter optimization for sklearn☆1,643Apr 15, 2025Updated 9 months ago
- Automated Machine Learning with scikit-learn☆8,048Jan 20, 2026Updated 3 weeks ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,393Feb 19, 2025Updated 11 months ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,048Sep 11, 2025Updated 5 months ago
- An open source python library for automated feature engineering☆7,610Feb 3, 2026Updated last week
- An intuitive library to add plotting functionality to scikit-learn objects.☆2,434Aug 20, 2024Updated last year
- Sequential model-based optimization with a `scipy.optimize` interface☆2,812Feb 23, 2024Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,771Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,372Feb 2, 2026Updated last week
- Open source time series library for Python☆2,138Oct 24, 2023Updated 2 years ago
- Python implementations of the Boruta all-relevant feature selection method.☆1,619Nov 13, 2025Updated 3 months ago
- Automatic extraction of relevant features from time series:☆9,109Nov 15, 2025Updated 2 months ago
- Extra blocks for scikit-learn pipelines.☆1,377Updated this week
- Describing statistical models in Python using symbolic formulas☆977Jan 26, 2026Updated 2 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,357Updated this week
- A scikit-learn compatible neural network library that wraps PyTorch☆6,149Dec 22, 2025Updated last month
- Parallel computing with task scheduling☆13,738Feb 5, 2026Updated last week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,006Updated this week
- Declarative visualization library for Python☆10,246Feb 6, 2026Updated last week
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆489Aug 11, 2017Updated 8 years ago
- Lime: Explaining the predictions of any machine learning classifier☆12,098Jul 25, 2024Updated last year
- NumPy and Pandas interface to Big Data☆3,198Sep 29, 2023Updated 2 years ago
- Missing data visualization module for Python.☆4,188May 14, 2024Updated last year
- Statsmodels: statistical modeling and econometrics in Python☆11,239Jan 13, 2026Updated last month
- Distributed Asynchronous Hyperparameter Optimization in Python☆7,619Updated this week
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆467Feb 27, 2025Updated 11 months ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,169Aug 30, 2021Updated 4 years ago
- A python library for decision tree visualization and model interpretation.☆3,124Jan 2, 2026Updated last month
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,079Feb 8, 2026Updated last week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- A high performance implementation of HDBSCAN clustering.☆3,059Jan 26, 2026Updated 2 weeks ago
- Bayesian Modeling and Probabilistic Programming in Python☆9,476Updated this week
- Plotting library for IPython/Jupyter notebooks☆3,682Jan 23, 2026Updated 3 weeks ago
- Scalable Machine Learning with Dask☆944Sep 27, 2025Updated 4 months ago
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆8,797Updated this week
- A game theoretic approach to explain the output of any machine learning model.☆25,023Updated this week