Scikit-learn is a robust, open-source machine learning library for Python, designed to simplify the implementation of a wide range of machine learning algorithms with minimal coding effort. It provides simple and efficient tools for data mining and data analysis, built atop NumPy, SciPy, and Matplotlib. The library covers a variety of supervised and unsupervised learning algorithms, including regression, classification, clustering, and dimensionality reduction. Its straightforward syntax promotes ease of use for both beginners and experienced developers, allowing for rapid prototyping and deployment of machine learning models. Scikit-learn also includes utilities for model selection, validation, and optimization, such as GridSearchCV for hyperparameter tuning. Its integration within the broader Python data ecosystem makes it an ideal choice for application developers looking to incorporate machine learning functionalities into their projects.
View the most prominent open source scikit-learn projects in the list below. Click on a specific project to view its alternative or complementary packages. Make comparisons and find the best package for your app.
- 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all☆85,063Updated this week
- 100 Days of ML Coding☆50,270Dec 29, 2023Updated 2 years ago
- Python Data Science Handbook: full text in Jupyter Notebooks☆47,197Jun 26, 2024Updated last year
- AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2☆42,161Nov 12, 2024Updated last year
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆28,983Mar 20, 2024Updated 2 years ago
- ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 or handson-mlp instead.☆25,807Mar 19, 2026Updated 3 weeks ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆23,399Apr 2, 2026Updated last week
- Open standard for machine learning interoperability☆20,584Apr 3, 2026Updated last week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,779Updated this week
- Your new Mentor for Data Science E-Learning.☆14,318Oct 14, 2025Updated 5 months ago
- Parallel computing with task scheduling☆13,784Apr 2, 2026Updated last week
- The "Python Machine Learning (1st edition)" book code repository and info resource☆12,605Nov 20, 2024Updated last year
- Open Machine Learning Course☆10,533Mar 1, 2026Updated last month
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,048Sep 11, 2025Updated 6 months ago
- Fast and Accurate ML in 3 Lines of Code☆10,201Updated this week
- A unified framework for machine learning with time series☆9,688Updated this week
- My blogs and code for machine learning. http://cnblogs.com/pinard☆8,702Feb 16, 2024Updated 2 years ago
- Automated Machine Learning with scikit-learn☆8,078Jan 20, 2026Updated 2 months ago
- An open source python library for automated feature engineering☆7,628Feb 3, 2026Updated 2 months ago
- The "Python Machine Learning (2nd edition)" book code repository and info resource☆7,206Oct 1, 2020Updated 5 years ago
- Fit interpretable models. Explain blackbox machine learning.☆6,822Mar 26, 2026Updated 2 weeks ago
- A scikit-learn compatible neural network library that wraps PyTorch☆6,151Mar 27, 2026Updated 2 weeks ago
- A curated list of project tutorials for project-based learning.☆5,735Nov 15, 2025Updated 4 months ago
- 🍊 Orange: Interactive data analysis☆5,596Apr 3, 2026Updated last week
- [译] scikit-learn(sklearn) 中文文档☆5,234Jul 21, 2023Updated 2 years ago
- Code Repository for Machine Learning with PyTorch and Scikit-Learn☆5,099Jan 2, 2026Updated 3 months ago
- The "Python Machine Learning (3rd edition)" book code repository☆5,009Apr 19, 2023Updated 2 years ago
- A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.☆4,996Sep 22, 2023Updated 2 years ago
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,548Apr 3, 2026Updated last week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,397Feb 19, 2025Updated last year
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆4,316Updated this week
- A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep …☆3,982Mar 19, 2025Updated last year
- Jupyter notebooks from the scikit-learn video series☆3,783Mar 5, 2024Updated 2 years ago
- 🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.☆3,676Jun 28, 2024Updated last year
- A collection of scientific methods, processes, algorithms, and systems to build stories & models.☆3,665Nov 7, 2025Updated 5 months ago
- 🛠 All-in-one web-based IDE specialized for machine learning and data science.☆3,542Jul 26, 2024Updated last year
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,532Jul 17, 2025Updated 8 months ago
- Seamlessly integrate LLMs into scikit-learn.☆3,487Apr 1, 2026Updated last week
- High-Performance Symbolic Regression in Python and Julia☆3,463Apr 2, 2026Updated last week