A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search, share, and govern specific subsets of data across a business or with other external businesses.
☆401Jun 3, 2024Updated last year
Alternatives and similar repositories for aws-data-lake-solution
Users that are interested in aws-data-lake-solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Nov 28, 2018Updated 7 years ago
- Enterprise-grade, production-hardened, serverless data lake on AWS☆478Oct 1, 2025Updated 5 months ago
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Apr 9, 2019Updated 6 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Mar 24, 2020Updated 5 years ago
- Data Lake as Code, featuring ChEMBL and OpenTargets☆173Nov 20, 2023Updated 2 years ago
- Configure an LDAPS Endpoint for Simple AD☆14Aug 29, 2017Updated 8 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Dec 27, 2020Updated 5 years ago
- This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…☆17Dec 17, 2020Updated 5 years ago
- AWS Glue code samples☆1,535Nov 5, 2025Updated 4 months ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆698Jan 13, 2026Updated 2 months ago
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 5 years ago
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆91Sep 25, 2024Updated last year
- S3 Snapshot script to run from command-line or scheduled in Lambda.☆28May 7, 2019Updated 6 years ago
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Jun 4, 2019Updated 6 years ago
- ☆157Feb 29, 2024Updated 2 years ago
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 5 years ago
- A collection of examples built with AWS DataOps Development Kit (DDK)☆43Jan 7, 2026Updated 2 months ago
- Generative AI Application Builder on AWS facilitates the development, rapid experimentation, and deployment of generative artificial inte…☆333Updated this week
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆29Dec 22, 2020Updated 5 years ago
- Amazon Redshift Database Loader implemented in AWS Lambda☆595Jul 16, 2024Updated last year
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆39Jun 11, 2024Updated last year
- AWS re:Invent 2018 DEV372 "Infrastructure is Code" demo☆15Sep 8, 2020Updated 5 years ago
- ☆22Jul 14, 2020Updated 5 years ago
- A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.☆345Mar 29, 2024Updated last year
- ☆250Mar 1, 2024Updated 2 years ago
- Amazon Redshift Advanced Monitoring☆270Oct 28, 2025Updated 4 months ago
- A page escalation system using AWS Lambda, Step Functions, and API Gateway.☆26May 12, 2018Updated 7 years ago
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆607Updated this week
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆29Jul 24, 2019Updated 6 years ago
- Lab Instructions for Data Engineering Immersion Day☆197Jan 26, 2026Updated last month
- The Automated Account Configuration is a sample solution to enable operational scale for AWS customers by automating repeatable steps req…☆14Jul 26, 2023Updated 2 years ago
- Source repository for the "Build a cloud experience with Cloudscape, an open-source design system" workshop.☆16Jan 22, 2025Updated last year
- Repository for AWS DBS Reference Architectures - Enterprise Data Warehousing☆35Jan 17, 2019Updated 7 years ago
- Amazon SageMaker workshops: Introduction, TensorFlow in SageMaker, and more☆390Jan 14, 2026Updated 2 months ago
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆176Apr 28, 2020Updated 5 years ago
- The article provides a push-button solution to protect your infrastructure against incoming traffic from embargoed countries as defined b…☆15Jun 1, 2019Updated 6 years ago
- SaaS on AWS Bootcamp - Building SaaS Solutions on AWS☆418Mar 1, 2024Updated 2 years ago
- An automated reference implementation that assists with setting up corss account roles for easy federation of users from one AWS master a…☆55Mar 28, 2018Updated 7 years ago
- This application (in the form of a lambda function) will publish CloudWatch metrics based on API usage. It listens to a CloudWatch Log St…☆31Jan 21, 2022Updated 4 years ago