awsdocs / aws-data-pipeline-developer-guide
The open source version of the AWS Data Pipeline documentation. To provide feedback & requests for changes, submit issues in this repository, or make proposed changes & submit a pull request.
☆16Updated last year
Related projects: ⓘ
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Updated 5 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 3 years ago
- ☆52Updated 5 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- This guide has been archived. Please see https://github.com/awsdocs/amazon-s3-userguide for an open source version of the Amazon S3 docs.…☆34Updated 3 years ago
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆61Updated 5 years ago
- ☆67Updated this week
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆61Updated last year
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 5 years ago
- The open source version of the Amazon Kinesis Data Streams docs. You can submit feedback & requests for changes by submitting issues in t…☆26Updated last year
- ☆53Updated 7 years ago
- Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…☆16Updated 4 years ago
- Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker☆18Updated 5 years ago
- Samples and documentation for various advertising and marketing use cases on AWS.☆35Updated last year
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t …☆51Updated 7 months ago
- The open source version of the AWS Toolkit for Visual Studio Code user guide. You can submit feedback & requests for changes by submittin…☆14Updated last year
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆30Updated 4 years ago
- User guide for VM Import/Export☆14Updated last year
- Replication utility for AWS Glue Data Catalog☆73Updated last month
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated 3 months ago
- ☆9Updated 10 months ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆35Updated 2 months ago
- Reference Architectures for Datalakes on AWS☆76Updated 4 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆86Updated last year
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆66Updated 2 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆30Updated last year
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 2 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆72Updated 3 years ago
- Jupyter notebook that calls Rekognition, displays an image, and calls a local Neo4j DB to display a graph of relationships☆27Updated 4 years ago