getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for quickstart-ml-blueprints
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 2 months ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆29Updated last year
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 4 months ago
- A few end to end examples that use data-describe☆16Updated last year
- Render Jupyter Notebooks With Metaflow Cards☆24Updated last month
- This repository contains code to build an MVP search engine with google like interface.☆16Updated 4 years ago
- NiFi Processor for Apache Pulsar☆10Updated 2 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆111Updated last week
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- rust-for-data☆43Updated last year
- Source code for the post Effortless deployments with MLFlow, showcasing how logging models using MLFLow can provide you want to easily de…☆16Updated last year
- Apache Spark based framework for analysis A/B experiments☆11Updated 2 weeks ago
- ☆28Updated last month
- Guide to data platforms and tools☆31Updated 2 years ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- Batteries included toolkit for data engineering.☆32Updated this week
- Receipes of publicly-available Jupyter images☆8Updated last month
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆17Updated this week
- Events about the open source data stack☆13Updated 2 years ago
- ☆13Updated 9 months ago
- Example project using DBT, Databricks and AdventureWorks sample database☆10Updated 2 years ago
- Kedro Plugin to support running pipelines on AWS SageMaker.☆19Updated 11 months ago
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆63Updated last month
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆32Updated last year
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated this week
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 2 years ago