aws / amazon-mwaa-docker-images
☆26Updated this week
Related projects ⓘ
Alternatives and complementary repositories for amazon-mwaa-docker-images
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆19Updated 3 weeks ago
- Utility functions for dbt projects running on Spark☆31Updated last year
- Build DataOps platform with Apache Airflow and dbt on AWS☆51Updated 3 years ago
- simplify working with DataHub API endpoints☆40Updated last week
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆19Updated 2 months ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated last year
- Curated list of resources about Apache Airflow☆19Updated 3 years ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆106Updated 2 months ago
- Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.☆38Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆43Updated 3 years ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆18Updated 2 months ago
- The dbt adapter for Firebolt☆29Updated last month
- AWS Quick Start Team☆18Updated last month
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆151Updated 2 weeks ago
- Demo for GitHub Universe 2022☆12Updated last year
- Unity Catalog UI☆39Updated 2 months ago
- In this repository, we show how to get started with data lineage on AWS using OpenLineage. This is an AWS Cloud Development Kit project (…☆12Updated 3 months ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆137Updated last week
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆20Updated 2 weeks ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆38Updated 6 months ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆20Updated last year
- ☆20Updated 8 months ago
- ☆66Updated 5 months ago
- Delta Lake Documentation☆46Updated 4 months ago
- RStoolKit - A utility to perform a complete health check of your AWS RedShift Cluster☆23Updated 4 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated last year