liangruibupt / glue-streaming-etl-demoLinks
AWS serverless etl and streaming demo
☆18Updated 3 years ago
Alternatives and similar repositories for glue-streaming-etl-demo
Users that are interested in glue-streaming-etl-demo are comparing it to the libraries listed below
Sorting:
- In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will…☆24Updated 6 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆98Updated 3 years ago
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆30Updated 4 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Updated last year
- Terraform module to create AWS Redshift resources 🇺🇦☆87Updated 6 months ago
- ☆27Updated 4 years ago
- ☆32Updated 3 years ago
- ☆73Updated last year
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆74Updated 2 weeks ago
- Workshop and lab content for Amazon Aurora MySQL compatible databases. This code will contain a series of templates, instructional guides…☆76Updated last year
- ☆52Updated 8 years ago
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆42Updated 2 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated 2 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆22Updated 3 months ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆33Updated 8 months ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆99Updated 4 years ago
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Updated 2 years ago
- Replication utility for AWS Glue Data Catalog☆80Updated last year
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆68Updated 3 years ago
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆33Updated 8 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆147Updated 2 years ago
- Build your own log analytics platform on OpenSearch in 20 minutes☆130Updated 3 weeks ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆116Updated 3 months ago
- ☆158Updated last year
- AWS Quick Start Team☆60Updated last year
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆66Updated 4 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated 9 months ago
- Simple script to breakdown AWS billing by "Project" tag☆36Updated 7 years ago