Scikit-learn is a robust, open-source machine learning library for Python, designed to simplify the implementation of a wide range of machine learning algorithms with minimal coding effort. It provides simple and efficient tools for data mining and data analysis, built atop NumPy, SciPy, and Matplotlib. The library covers a variety of supervised and unsupervised learning algorithms, including regression, classification, clustering, and dimensionality reduction. Its straightforward syntax promotes ease of use for both beginners and experienced developers, allowing for rapid prototyping and deployment of machine learning models. Scikit-learn also includes utilities for model selection, validation, and optimization, such as GridSearchCV for hyperparameter tuning. Its integration within the broader Python data ecosystem makes it an ideal choice for application developers looking to incorporate machine learning functionalities into their projects.
View the most prominent open source scikit-learn projects in the list below. Click on a specific project to view its alternative or complementary packages.
- 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all☆69,831Updated last week
- 100 Days of ML Coding☆45,496Updated 10 months ago
- Python Data Science Handbook: full text in Jupyter Notebooks☆43,247Updated 4 months ago
- AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2☆39,570Updated last week
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆27,487Updated 8 months ago
- ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.☆25,202Updated last year
- Open standard for machine learning interoperability☆17,949Updated this week
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆18,068Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆14,722Updated this week
- Your new Mentor for Data Science E-Learning.☆13,982Updated 8 months ago
- Parallel computing with task scheduling☆12,604Updated this week
- The "Python Machine Learning (1st edition)" book code repository and info resource☆12,276Updated 7 months ago
- Open Machine Learning Course☆9,788Updated 2 months ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,739Updated 3 months ago
- My blogs and code for machine learning. http://cnblogs.com/pinard☆8,342Updated 9 months ago
- Fast and Accurate ML in 3 Lines of Code☆8,051Updated this week
- A unified framework for machine learning with time series☆7,957Updated this week
- Automated Machine Learning with scikit-learn☆7,637Updated this week
- An open source python library for automated feature engineering☆7,272Updated this week
- The "Python Machine Learning (2nd edition)" book code repository and info resource☆7,120Updated 4 years ago
- Fit interpretable models. Explain blackbox machine learning.☆6,297Updated this week
- A scikit-learn compatible neural network library that wraps PyTorch☆5,884Updated 2 weeks ago
- [译] scikit-learn(sklearn) 中文文档☆5,124Updated last year
- Flower: A Friendly Federated AI Framework☆5,140Updated this week
- 🍊 Orange: Interactive data analysis☆4,881Updated this week
- A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.☆4,635Updated last year
- The "Python Machine Learning (3rd edition)" book code repository☆4,610Updated last year
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,293Updated last month
- PipelineAI☆4,172Updated 7 months ago
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆3,926Updated this week
- A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep …☆3,765Updated 4 months ago
- A curated list of project tutorials for project-based learning.☆3,739Updated 2 weeks ago
- Jupyter notebooks from the scikit-learn video series☆3,675Updated 8 months ago
- A collection of scientific methods, processes, algorithms, and systems to build stories & models.☆3,541Updated last month
- Code Repository for Machine Learning with PyTorch and Scikit-Learn☆3,616Updated 2 weeks ago
- 🛠 All-in-one web-based IDE specialized for machine learning and data science.☆3,437Updated 3 months ago
- 🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.☆3,445Updated 4 months ago
- Seamlessly integrate LLMs into scikit-learn.☆3,374Updated last week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,356Updated this week
- Practice and tutorial-style notebooks covering wide variety of machine learning techniques☆3,109Updated last year