garystafford / emr-msk-serverless-demo
Amazon EMR Serverless and Amazon MSK Serverless Demo
☆13Updated 2 years ago
Alternatives and similar repositories for emr-msk-serverless-demo
Users that are interested in emr-msk-serverless-demo are comparing it to the libraries listed below
Sorting:
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆12Updated 8 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 8 months ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated 2 weeks ago
- Utility functions for dbt projects running on Spark☆34Updated 3 months ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- ☆34Updated 2 years ago
- ☆12Updated 2 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Updated this week
- Materials for the next course☆24Updated 2 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- Pytest plugin for dbt core☆60Updated 4 months ago
- ☆18Updated last year
- Spark data pipeline that processes movie ratings data.☆28Updated last month
- Glue VSCode devcontainer setup☆14Updated 2 years ago
- ☆60Updated 3 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆41Updated last year
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- dbt adapter for Teradata☆23Updated last month
- Sample Airflow DAGs☆62Updated 2 years ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 8 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆164Updated 4 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- This repository contains the dbt-glue adapter☆120Updated last week
- ☆78Updated 7 months ago
- ☆13Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆36Updated 3 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- ☆30Updated last year