aws-solutions / aws-data-lake-solutionLinks
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search, share, and govern specific subsets of data across a business or with other external businesses.
☆401Updated last year
Alternatives and similar repositories for aws-data-lake-solution
Users that are interested in aws-data-lake-solution are comparing it to the libraries listed below
Sorting:
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆198Updated 2 years ago
- A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.☆340Updated last year
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- A UI that simplifies testing with Amazon Kinesis Streams and Firehose. Create and save record templates, and easily send data to Amazon K…☆207Updated 8 months ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆175Updated 5 years ago
- Amazon Redshift Advanced Monitoring☆272Updated 2 years ago
- Enterprise-grade, production-hardened, serverless data lake on AWS☆454Updated 2 months ago
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago
- ☆73Updated last year
- ☆158Updated last year
- ☆52Updated 7 years ago
- Data Lake as Code, featuring ChEMBL and OpenTargets☆170Updated last year
- kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.☆155Updated last year
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆673Updated last year
- An open source development framework to help you build data workflows and modern data architecture on AWS.☆267Updated 2 months ago
- A sample AWS Lambda function that accepts messages from an Amazon Kinesis Stream and transfers the messages to another data transport.☆289Updated 2 years ago
- Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…☆165Updated 5 years ago
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆242Updated last week
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆86Updated last year
- Sample code for dynamically managing RDS/RDBMS connections across a fleet of Lambda functions☆236Updated 6 years ago
- Samples and documentation for using the Amazon Neptune graph database service☆358Updated this week
- A set of sample database and associated items to allow customers to among other things follow along with published database migration rec…☆182Updated last year
- Lab Instructions for Data Engineering Immersion Day☆190Updated 4 months ago
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆580Updated this week
- AWS Lambda function to forward Stream data to Kinesis Firehose☆278Updated last year
- Sample CloudFormation templates and architecture for AWS Service Catalog☆436Updated last year
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Updated 6 years ago
- Tools and utilities to enable loading data and building graph applications with Amazon Neptune.☆309Updated this week