IBM / data-science-best-practices
The goal of this repository is to enable data scientists and ML engineers to develop data science use cases and making it ready for production use. This means focusing on the versioning, scalability, monitoring and engineering of the solution.
☆90Updated last year
Alternatives and similar repositories for data-science-best-practices:
Users that are interested in data-science-best-practices are comparing it to the libraries listed below
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆81Updated last year
- ☆84Updated 2 years ago
- MLOps maturity assessment☆61Updated 2 years ago
- Engineering MLOps, published by Packt☆184Updated 2 years ago
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases wit…☆48Updated 3 years ago
- Template repository for data science lifecycle project☆192Updated 4 years ago
- Demo repository implementing an end-to-end MLOps workflow on Databricks, using Azure DevOps for CICD orchestration. Project derived from …☆29Updated 2 years ago
- IBM watsonx.ai sample models, notebooks and apps.☆130Updated last week
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆182Updated 9 months ago
- Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in productio…☆83Updated last year
- ☆38Updated 2 years ago
- Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.☆214Updated 2 years ago
- An example MLFlow project☆48Updated 3 months ago
- Templates for your Kedro projects.☆73Updated 3 weeks ago
- Mastering Azure Machine Learning, published by packt☆51Updated 2 years ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆130Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- Code base for programming projects☆55Updated 2 months ago
- End to end MLRun demos☆92Updated 2 weeks ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.☆114Updated last year
- MLApp is a Python library for building scalable data science solutions that meet modern software engineering standards.☆44Updated 3 years ago
- ☆30Updated 2 years ago
- Machine Learning Engineering with MLflow, published by Packt☆114Updated 9 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- Reference architecture for machine learning operations☆37Updated 4 years ago
- ☆27Updated last year
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineering☆40Updated 5 months ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago