Scikit-learn is a robust, open-source machine learning library for Python, designed to simplify the implementation of a wide range of machine learning algorithms with minimal coding effort. It provides simple and efficient tools for data mining and data analysis, built atop NumPy, SciPy, and Matplotlib. The library covers a variety of supervised and unsupervised learning algorithms, including regression, classification, clustering, and dimensionality reduction. Its straightforward syntax promotes ease of use for both beginners and experienced developers, allowing for rapid prototyping and deployment of machine learning models. Scikit-learn also includes utilities for model selection, validation, and optimization, such as GridSearchCV for hyperparameter tuning. Its integration within the broader Python data ecosystem makes it an ideal choice for application developers looking to incorporate machine learning functionalities into their projects.
View the most prominent open source scikit-learn projects in the list below. Click on a specific project to view its alternative or complementary packages. Make comparisons and find the best package for your app.
- 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all☆84,015Updated this week
- 100 Days of ML Coding☆49,686Dec 29, 2023Updated 2 years ago
- Python Data Science Handbook: full text in Jupyter Notebooks☆46,850Jun 26, 2024Updated last year
- AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2☆42,051Nov 12, 2024Updated last year
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆28,896Mar 20, 2024Updated last year
- ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.☆25,882Oct 3, 2023Updated 2 years ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆23,250Updated this week
- Open standard for machine learning interoperability☆20,373Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,389Updated this week
- Your new Mentor for Data Science E-Learning.☆14,314Oct 14, 2025Updated 4 months ago
- Parallel computing with task scheduling☆13,746Updated this week
- The "Python Machine Learning (1st edition)" book code repository and info resource☆12,591Nov 20, 2024Updated last year
- Open Machine Learning Course☆10,452Jan 24, 2026Updated last month
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,049Sep 11, 2025Updated 5 months ago
- Fast and Accurate ML in 3 Lines of Code☆10,030Updated this week
- A unified framework for machine learning with time series☆9,544Feb 20, 2026Updated last week
- My blogs and code for machine learning. http://cnblogs.com/pinard☆8,693Feb 16, 2024Updated 2 years ago
- Automated Machine Learning with scikit-learn☆8,054Jan 20, 2026Updated last month
- An open source python library for automated feature engineering☆7,614Feb 3, 2026Updated 3 weeks ago
- The "Python Machine Learning (2nd edition)" book code repository and info resource☆7,195Oct 1, 2020Updated 5 years ago
- Fit interpretable models. Explain blackbox machine learning.☆6,802Updated this week
- Flower: A Friendly Federated AI Framework☆6,643Updated this week
- A scikit-learn compatible neural network library that wraps PyTorch☆6,150Feb 16, 2026Updated last week
- 🍊 Orange: Interactive data analysis☆5,558Updated this week
- A curated list of project tutorials for project-based learning.☆5,517Nov 15, 2025Updated 3 months ago
- [译] scikit-learn(sklearn) 中文文档☆5,235Jul 21, 2023Updated 2 years ago
- The "Python Machine Learning (3rd edition)" book code repository☆4,990Apr 19, 2023Updated 2 years ago
- A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.☆4,983Sep 22, 2023Updated 2 years ago
- Code Repository for Machine Learning with PyTorch and Scikit-Learn☆5,004Jan 2, 2026Updated last month
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,395Feb 19, 2025Updated last year
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,394Updated this week
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆4,286Updated this week
- A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep …☆3,977Mar 19, 2025Updated 11 months ago
- Jupyter notebooks from the scikit-learn video series☆3,778Mar 5, 2024Updated last year
- 🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.☆3,672Jun 28, 2024Updated last year
- A collection of scientific methods, processes, algorithms, and systems to build stories & models.☆3,663Nov 7, 2025Updated 3 months ago
- 🛠 All-in-one web-based IDE specialized for machine learning and data science.☆3,547Jul 26, 2024Updated last year
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,529Jul 17, 2025Updated 7 months ago
- Seamlessly integrate LLMs into scikit-learn.☆3,490Feb 1, 2026Updated 3 weeks ago
- High-Performance Symbolic Regression in Python and Julia☆3,378Updated this week