aws-solutions / aws-data-lake-solutionLinks
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search, share, and govern specific subsets of data across a business or with other external businesses.
☆401Updated last year
Alternatives and similar repositories for aws-data-lake-solution
Users that are interested in aws-data-lake-solution are comparing it to the libraries listed below
Sorting:
- A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.☆344Updated last year
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- A UI that simplifies testing with Amazon Kinesis Streams and Firehose. Create and save record templates, and easily send data to Amazon K…☆211Updated last year
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆176Updated 5 years ago
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆201Updated 2 years ago
- Enterprise-grade, production-hardened, serverless data lake on AWS☆476Updated 4 months ago
- Reference Architectures for Datalakes on AWS☆78Updated 5 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 5 years ago
- ☆157Updated last year
- Data Lake as Code, featuring ChEMBL and OpenTargets☆173Updated 2 years ago
- Amazon Redshift Advanced Monitoring☆273Updated 3 months ago
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆245Updated 2 weeks ago
- ☆72Updated last year
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆86Updated 2 years ago
- Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…☆165Updated 6 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- ☆52Updated 8 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆99Updated 3 years ago
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆603Updated this week
- An open source development framework to help you build data workflows and modern data architecture on AWS.☆271Updated 9 months ago
- A solutions that automatically configures the AWS services necessary to easily capture, store, process, and deliver streaming data. This …☆94Updated last year
- ☆74Updated 2 years ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆696Updated 3 weeks ago
- Glue scripts for converting AWS Service Logs for use in Athena☆140Updated 2 years ago
- This repository hosts sample pipelines☆470Updated 5 years ago
- A reference architecture for handling batch processing workloads using Amazon ECS.☆151Updated 5 years ago
- Samples and documentation for using the Amazon Neptune graph database service☆368Updated this week
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 7 years ago
- Continuously monitors a set of log files and sends new data to the Amazon Kinesis Stream and Amazon Kinesis Firehose in near-real-time.☆373Updated last month
- Can you set up a data warehouse and create a dashboard in under 60 minutes? In this workshop, we show you how with Amazon Redshift, a ful…☆29Updated 6 years ago