garystafford / tickit-data-lake-demo
Resources for video demonstrations and blog posts related to DataOps on AWS
☆172Updated 3 years ago
Alternatives and similar repositories for tickit-data-lake-demo:
Users that are interested in tickit-data-lake-demo are comparing it to the libraries listed below
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆112Updated 3 months ago
- Execution of DBT models using Apache Airflow through Docker Compose☆114Updated 2 years ago
- Code for dbt tutorial☆151Updated 9 months ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆40Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- This repository contains the dbt-glue adapter☆109Updated this week
- Code snippets for Data Engineering Design Patterns book☆73Updated 3 weeks ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆161Updated last month
- Docker with Airflow and Spark standalone cluster☆251Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆256Updated 7 months ago
- Sample project to demonstrate data engineering best practices☆179Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆170Updated last year
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆56Updated 4 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆127Updated 7 months ago
- Cloned by the `dbt init` task☆61Updated 10 months ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆192Updated 3 weeks ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆42Updated 3 months ago
- Spark runtime on AWS Lambda☆106Updated 5 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆210Updated last week
- Data pipeline with dbt, Airflow, Great Expectations☆161Updated 3 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆234Updated 3 weeks ago
- ☆34Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- ☆122Updated 2 weeks ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆42Updated 2 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Project for "Data pipeline design patterns" blog.☆44Updated 6 months ago
- build dw with dbt☆37Updated 4 months ago