garystafford / tickit-data-lake-demoLinks
Resources for video demonstrations and blog posts related to DataOps on AWS
☆178Updated 3 years ago
Alternatives and similar repositories for tickit-data-lake-demo
Users that are interested in tickit-data-lake-demo are comparing it to the libraries listed below
Sorting:
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆116Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆166Updated 6 months ago
- Code snippets for Data Engineering Design Patterns book☆127Updated 3 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆117Updated 2 years ago
- This repository contains the dbt-glue adapter☆127Updated this week
- A repository of sample code to accompany our blog post on Airflow and dbt.☆174Updated last year
- Code for dbt tutorial☆156Updated last month
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Spark runtime on AWS Lambda☆108Updated 9 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 3 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆47Updated 7 months ago
- Docker with Airflow and Spark standalone cluster☆261Updated last year
- Sample Airflow DAGs☆62Updated 2 years ago
- Build DataOps platform with Apache Airflow and dbt on AWS☆57Updated 4 years ago
- ☆134Updated 5 months ago
- Lab Instructions for Data Engineering Immersion Day☆190Updated 5 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated 2 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Pyspark boilerplate for running prod ready data pipeline☆29Updated 4 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆124Updated 3 weeks ago
- Simple stream processing pipeline☆103Updated last year
- ☆61Updated 4 years ago
- Spark data pipeline that processes movie ratings data.☆29Updated 2 weeks ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- ☆34Updated 2 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆113Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆91Updated 5 years ago
- Delta Lake examples☆226Updated 9 months ago