nanlabs / aws-glue-etl-boilerplate
A complete example of an AWS Glue application that uses the Serverless Framework to deploy the infrastructure and DevContainers and/or Docker Compose to run the application locally with AWS Glue Libs, Spark, Jupyter Notebook, AWS CLI, among other tools. It provides jobs using Python Shell and PySpark.
☆18Updated 9 months ago
Alternatives and similar repositories for aws-glue-etl-boilerplate:
Users that are interested in aws-glue-etl-boilerplate are comparing it to the libraries listed below
- This repository contains different Frontend related resources like applications, examples, libraries, tools, etc.☆16Updated last week
- This repository contains different React components, hooks, apps and libraries that are used in different projects here at NaN Labs.☆23Updated 5 months ago
- This is a curated list of all the Open Source examples and projects we have at NaNLABS☆21Updated this week
- This repository contains different infrastructure components, CI/CD pipelines, automation tools among other resources that are used in di…☆46Updated last month
- Get started quickly with AWS infrastructure using a robust Terraform starter kit incorporating secure state management, VPC configuration…☆25Updated 4 months ago
- ☆30Updated last year
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆10Updated 2 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- This repository contains the dbt-glue adapter☆117Updated this week
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆106Updated last month
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆41Updated 2 years ago
- Constructs to deploy airflow via the aws cdk☆27Updated 4 years ago
- ☆32Updated last year
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆45Updated 5 months ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆32Updated 2 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆163Updated 4 months ago
- ☆14Updated 2 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆36Updated 2 months ago
- Set up a modern web app by running one command with different addons☆9Updated 4 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue