ricardolsmendes / aws-glue-ci-cd-blueprint
Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post
☆11Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for aws-glue-ci-cd-blueprint
- Code snippets for Data Engineering Design Patterns book☆40Updated last week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated last month
- Delta Lake Documentation☆46Updated 5 months ago
- This repository contains the dbt-glue adapter☆101Updated this week
- ☆66Updated last month
- Code to demonstrate data engineering metadata & logging best practices☆15Updated 8 months ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆106Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆153Updated this week
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆72Updated last year
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆46Updated last month
- Spark runtime on AWS Lambda☆97Updated 2 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆196Updated this week
- End to end data engineering project☆51Updated 2 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆189Updated this week
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆30Updated 6 months ago
- Some example projects for Data Engineers to build, end-to-end.☆27Updated last year
- ☆104Updated 3 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆37Updated last week
- New generation opensource data stack☆61Updated 2 years ago
- Quick Guides from Dremio on Several topics☆65Updated 3 weeks ago
- A Python Library to support running data quality rules while the spark job is running⚡☆163Updated last week
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- A custom end-to-end data pipeline for customer churn☆9Updated 3 weeks ago
- A terraform module that deploys Dagster to AWS, using ECS.☆29Updated 2 years ago
- Build DataOps platform with Apache Airflow and dbt on AWS☆51Updated 3 years ago