garystafford / tickit-data-lake-demo
Resources for video demonstrations and blog posts related to DataOps on AWS
☆176Updated 3 years ago
Alternatives and similar repositories for tickit-data-lake-demo
Users that are interested in tickit-data-lake-demo are comparing it to the libraries listed below
Sorting:
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆115Updated 5 months ago
- Code snippets for Data Engineering Design Patterns book☆106Updated last month
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- This repository contains the dbt-glue adapter☆120Updated last week
- Spark runtime on AWS Lambda☆107Updated 7 months ago
- Docker with Airflow and Spark standalone cluster☆257Updated last year
- ☆130Updated 3 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆164Updated 4 months ago
- Simple stream processing pipeline☆102Updated 10 months ago
- Code for dbt tutorial☆157Updated 11 months ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆65Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆261Updated 10 months ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆33Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆116Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆114Updated last month
- A repository of sample code to accompany our blog post on Airflow and dbt.☆172Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- Materials for the next course☆24Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆144Updated 4 years ago
- Sample project to demonstrate data engineering best practices☆190Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated 2 weeks ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆45Updated 5 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆102Updated 4 years ago
- ☆65Updated 2 weeks ago
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆52Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆162Updated 3 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago