aws-solutions / aws-data-lake-solution
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, search, share, and govern specific subsets of data across a business or with other external businesses.
☆401Updated 10 months ago
Alternatives and similar repositories for aws-data-lake-solution:
Users that are interested in aws-data-lake-solution are comparing it to the libraries listed below
- Enterprise-grade, production-hardened, serverless data lake on AWS☆448Updated 2 weeks ago
- A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.☆336Updated last year
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- A UI that simplifies testing with Amazon Kinesis Streams and Firehose. Create and save record templates, and easily send data to Amazon K…☆205Updated 6 months ago
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆198Updated last year
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- Amazon Redshift Advanced Monitoring☆272Updated 2 years ago
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆175Updated 4 years ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆670Updated 11 months ago
- An open source development framework to help you build data workflows and modern data architecture on AWS.☆263Updated 2 weeks ago
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆243Updated last month
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- The open source version of the AWS Step Functions Developer Guide. You can submit feedback & requests for changes by submitting issues in…☆159Updated last year
- ☆158Updated last year
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆573Updated this week
- Data Lake as Code, featuring ChEMBL and OpenTargets☆169Updated last year
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Updated 6 years ago
- Amazon Redshift Database Loader implemented in AWS Lambda☆596Updated 9 months ago
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆85Updated last year
- A sample AWS Lambda function that accepts messages from an Amazon Kinesis Stream and transfers the messages to another data transport.☆289Updated 2 years ago
- ☆72Updated 10 months ago
- A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficie…☆240Updated last week
- ☆52Updated 7 years ago
- A reference architecture for handling batch processing workloads using Amazon ECS.☆150Updated 4 years ago
- Amazon Virtual Private Cloud—AWS Solution☆325Updated 6 months ago
- Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS☆291Updated last year
- Deal with the complexities of dealing with a long lived transaction across distributed components in your microservices architecture usin…☆137Updated last year
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆89Updated 6 months ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- Samples to help you get started with the Amazon Redshift Data API☆73Updated last year