aws-solutions-library-samples / aws-insurancelake-etlLinks
This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, AWS Glue for data transformation, and AWS CDK Pipelines. It is originally based on the AWS blog Deploy data lake ETL jobs using CDK Pipelines, and complements the InsuranceLake Infrastructure project
☆27Updated 2 months ago
Alternatives and similar repositories for aws-insurancelake-etl
Users that are interested in aws-insurancelake-etl are comparing it to the libraries listed below
Sorting:
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- ☆54Updated last year
- This Guidance demonstrates how you can extend the data governance capabilities of Amazon DataZone to other Java Database Connectivity (JD…☆13Updated 8 months ago
- ☆32Updated last year
- This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.☆9Updated last month
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆68Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆51Updated last year
- ☆31Updated last year
- ☆26Updated 10 months ago
- aws-solutions-library-samples / guidance-for-preparing-and-validating-records-for-entity-resolution-on-awsThis Guidance demonstrates how to prepare and validate Personally Identifiable Information (PII) data, including physical address, phone,…☆9Updated 8 months ago
- ☆73Updated last year
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆15Updated 2 months ago
- In this repository, we show how to get started with data lineage on AWS using OpenLineage. This is an AWS Cloud Development Kit project (…☆13Updated 11 months ago
- ☆13Updated last year
- Integrate SageMaker Data Wrangler into your MLOps workflows with Amazon SageMaker Pipelines, AWS Step Functions, and Amazon Managed Workf…☆18Updated 2 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆93Updated 2 years ago
- MLOps Pipeline Using SageMaker & CDK, where models are from SageMaker built-in algorithms.☆27Updated 2 months ago
- ☆18Updated last year
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Updated 6 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- ☆9Updated 8 months ago
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Updated 5 years ago
- ☆20Updated last week
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 9 months ago
- Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector☆16Updated 6 months ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆32Updated 4 months ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆31Updated 2 years ago
- Code companion to AWS Compute Series on building a serverless backend for a streaming application. Questions? Contact @jbesw.☆45Updated 3 months ago
- A collection of examples built with AWS DataOps Development Kit (DDK)☆42Updated last week