HsiehShuJeng / cdk-emrserverless-with-delta-lakeLinks
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you could also launch an EMR notebook via cluster template to check the outcome from the EMR Serverless application.
☆11Updated 2 months ago
Alternatives and similar repositories for cdk-emrserverless-with-delta-lake
Users that are interested in cdk-emrserverless-with-delta-lake are comparing it to the libraries listed below
Sorting:
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Updated 2 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Updated 8 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated last year
- Data Lake as Code, featuring ChEMBL and OpenTargets☆173Updated 2 years ago
- ☆34Updated 3 years ago
- Samples to help you get started with the Amazon Redshift Data API☆71Updated 2 years ago
- ☆72Updated last year
- ☆32Updated last year
- ☆18Updated last year
- Spark runtime on AWS Lambda☆113Updated 4 months ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆31Updated 2 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆90Updated 3 years ago
- ☆29Updated last year
- Build DataOps platform with Apache Airflow and dbt on AWS☆59Updated 4 years ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆118Updated last month
- Replication utility for AWS Glue Data Catalog☆79Updated last year
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆68Updated 3 years ago
- ☆54Updated 5 months ago
- A solutions that automatically configures the AWS services necessary to easily capture, store, process, and deliver streaming data. This …☆94Updated 11 months ago
- ☆27Updated 5 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 5 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆99Updated 3 years ago
- An open-source framework that simplifies implementation of data solutions.☆146Updated last month
- Reference Architectures for Datalakes on AWS☆78Updated 5 years ago
- This repository contains the dbt-glue adapter☆139Updated 2 weeks ago
- Lab Instructions for Data Engineering Immersion Day☆196Updated 11 months ago
- ☆32Updated last year
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆67Updated last week
- ☆74Updated 2 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 3 years ago