andresionek91 / airflow-fargate-cdk
Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.
β14Updated 3 years ago
Alternatives and similar repositories for airflow-fargate-cdk:
Users that are interested in airflow-fargate-cdk are comparing it to the libraries listed below
- π Docker image for AWS Glue Spark/Pythonβ23Updated last year
- Glue VSCode devcontainer setupβ14Updated 2 years ago
- A CLI to manage and monitor permissions in AWS Lake Formationβ26Updated 2 years ago
- β60Updated 3 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms andβ¦β28Updated 2 years ago
- β16Updated last year
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message brokerβ¦β84Updated 2 years ago
- Constructs to deploy airflow via the aws cdkβ27Updated 4 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsβ29Updated 2 years ago
- β30Updated last year
- β73Updated 10 months ago
- β23Updated 6 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRβ35Updated 2 months ago
- β53Updated last year
- β12Updated 2 years ago
- Helper library to run AWS Glue ETL scripts docker container for local testing of development in a Jupyter notebookβ20Updated last year
- β22Updated 4 years ago
- Spark ETL example processing New York taxi rides public dataset on EKSβ44Updated 2 years ago
- Spark runtime on AWS Lambdaβ107Updated 7 months ago
- Airflow Deployment on AWS ECS Fargate Using Cloudformationβ204Updated 2 years ago
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago
- Build DataOps platform with Apache Airflow and dbt on AWSβ55Updated 3 years ago
- Spark env to Glue developmentβ9Updated 3 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.β48Updated last year
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concuβ¦β75Updated 6 years ago
- Repo with scripts and automation to help ensure best practices in Google Data Catalogβ13Updated 3 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.β17Updated 2 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Clusterβ¦β10Updated 4 years ago
- This repository contains the dbt-glue adapterβ116Updated this week
- Demo for GitHub Universe 2022β12Updated 2 years ago