aws-solutions/aws-data-lake-solution

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-solutions/aws-data-lake-solution)

aws-solutions / aws-data-lake-solution

A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search, share, and govern specific subsets of data across a business or with other external businesses.

☆399

Alternatives and similar repositories for aws-data-lake-solution

Users that are interested in aws-data-lake-solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-ml-data-lake-workshop
View on GitHub
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…
☆63Nov 28, 2018Updated 7 years ago
aws-solutions-library-samples / data-lakes-on-aws
View on GitHub
Enterprise-grade, production-hardened, serverless data lake on AWS
☆482Oct 1, 2025Updated 9 months ago
aws-samples / aws-building-data-lake-reinvent-session-stg206
View on GitHub
Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…
☆26Apr 9, 2019Updated 7 years ago
aws-samples / amazon-serverless-datalake-workshop
View on GitHub
A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.
☆158Mar 24, 2020Updated 6 years ago
aws-samples / data-lake-as-code
View on GitHub
Data Lake as Code, featuring ChEMBL and OpenTargets
☆173Nov 20, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
awslabs / cloudformation-ldaps-haproxy-template
View on GitHub
Configure an LDAPS Endpoint for Simple AD
☆14Aug 29, 2017Updated 8 years ago
aws-samples / accelerated-data-lake
View on GitHub
A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch
☆73Dec 27, 2020Updated 5 years ago
awslabs / predictive-segmentation-using-amazon-pinpoint-and-amazon-sagemaker
View on GitHub
This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…
☆17Dec 17, 2020Updated 5 years ago
aws-samples / aws-glue-samples
View on GitHub
AWS Glue code samples
☆1,539Jun 8, 2026Updated last month
awslabs / aws-glue-libs
View on GitHub
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
☆702Jul 1, 2026Updated 3 weeks ago
aws-samples / aws-dbs-refarch-datalake
View on GitHub
Reference Architectures for Datalakes on AWS
☆78May 13, 2020Updated 6 years ago
awslabs / aws-s3snapshot
View on GitHub
S3 Snapshot script to run from command-line or scheduled in Lambda.
☆27May 7, 2019Updated 7 years ago
aws-solutions / automated-data-analytics-on-aws
View on GitHub
The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…
☆90Sep 25, 2024Updated last year
amazon-archives / harmonize-search-analyze
View on GitHub
Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…
☆22Jun 4, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aws-samples / aws-analytics-reference-architecture
View on GitHub
☆157Feb 29, 2024Updated 2 years ago
aws-samples / aws-lambda-etl-ref-architecture
View on GitHub
This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…
☆24Jun 16, 2020Updated 6 years ago
aws-samples / data-purging-aws-data-lake
View on GitHub
☆22Jul 14, 2020Updated 6 years ago
aws-samples / aws-ddk-examples
View on GitHub
A collection of examples built with AWS DataOps Development Kit (DDK)
☆42Mar 23, 2026Updated 4 months ago
aws-solutions-library-samples / real-time-analytics-spark-streaming
View on GitHub
A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.
☆39Jun 11, 2024Updated 2 years ago
garystafford / athena-glue-quicksight-demo
View on GitHub
Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'
☆29Dec 22, 2020Updated 5 years ago
awslabs / aws-lambda-redshift-loader
View on GitHub
Amazon Redshift Database Loader implemented in AWS Lambda
☆595Jul 16, 2024Updated 2 years ago
amazon-archives / cdk-reinvent
View on GitHub
AWS re:Invent 2018 DEV372 "Infrastructure is Code" demo
☆15Sep 8, 2020Updated 5 years ago
aws-solutions-library-samples / Guidance-for-Authentication-with-Digital-Wallets-on-AWS
View on GitHub
dApp authentication with Amazon Cognito and Web3 proxy with Amazon API Gateway
☆15Aug 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-solutions / aws-centralized-logging
View on GitHub
☆250Mar 1, 2024Updated 2 years ago
aws-samples / aws-etl-orchestrator
View on GitHub
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
☆345Mar 29, 2024Updated 2 years ago
amazon-archives / aws-lambda-serverless-escalator
View on GitHub
A page escalation system using AWS Lambda, Step Functions, and API Gateway.
☆26May 12, 2018Updated 8 years ago
awslabs / amazon-s3-step-functions-ingestion-orchestration
View on GitHub
Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…
☆29Jul 24, 2019Updated 7 years ago
awslabs / amazon-redshift-monitoring
View on GitHub
Amazon Redshift Advanced Monitoring
☆269Oct 28, 2025Updated 8 months ago
aws-samples / data-engineering-for-aws-immersion-day
View on GitHub
Lab Instructions for Data Engineering Immersion Day
☆198Jan 26, 2026Updated 5 months ago
awslabs / automated-account-configuration
View on GitHub
The Automated Account Configuration is a sample solution to enable operational scale for AWS customers by automating repeatable steps req…
☆14Jul 26, 2023Updated 2 years ago
awslabs / aws-athena-query-federation
View on GitHub
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
☆611Updated this week
aws-samples / serverless-data-analytics
View on GitHub
CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries
☆177Apr 28, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aws-samples / aws-dbs-refarch-edw
View on GitHub
Repository for AWS DBS Reference Architectures - Enterprise Data Warehousing
☆35Jan 17, 2019Updated 7 years ago
aws-samples / cloudscape-design-system-workshop
View on GitHub
Source repository for the "Build a cloud experience with Cloudscape, an open-source design system" workshop.
☆16Jan 22, 2025Updated last year
awslabs / amazon-sagemaker-workshop
View on GitHub
Amazon SageMaker workshops: Introduction, TensorFlow in SageMaker, and more
☆387Jan 14, 2026Updated 6 months ago
aws-samples / aws-waf-embargoed-countries-ofac
View on GitHub
The article provides a push-button solution to protect your infrastructure against incoming traffic from embargoed countries as defined b…
☆16Jun 1, 2019Updated 7 years ago
aws-solutions / generative-ai-application-builder-on-aws
View on GitHub
Generative AI Application Builder on AWS facilitates the development, rapid experimentation, and deployment of generative artificial inte…
☆350Jul 16, 2026Updated last week
aws-samples / aws-saas-factory-bootcamp
View on GitHub
SaaS on AWS Bootcamp - Building SaaS Solutions on AWS
☆416May 23, 2026Updated 2 months ago
aws / aws-sdk-pandas
View on GitHub
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…
☆4,117Updated this week