zzstoatzz / oreilly-workflow-orchestration
☆26Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for oreilly-workflow-orchestration
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.☆11Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago
- ☆25Updated 2 years ago
- Collection of code snippets for blogs, conferences, and talks☆23Updated 2 years ago
- csv and flat-file sniffer built in Rust.☆42Updated 9 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆48Updated last year
- Code examples showing flow deployment to various types of infrastructure☆102Updated last year
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 6 months ago
- An example MLFlow project☆48Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- ☆83Updated last year
- The easiest way to integrate Kedro and Great Expectations☆53Updated last year
- Templates for your Kedro projects.☆67Updated this week
- Deploy a Prefect flow to serverless AWS Lambda function☆36Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆174Updated this week
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆55Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆63Updated last month
- Prefect 2 flows☆11Updated 8 months ago
- Code snippets for Data Engineering Design Patterns book☆40Updated last week
- An abstraction layer for parameter tuning☆36Updated 2 months ago
- ☆29Updated 11 months ago
- ☆30Updated last year
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆19Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- It's all in the name☆74Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆42Updated 8 months ago
- Prefect integrations with SQLAlchemy.☆25Updated 6 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 2 months ago
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆42Updated last year