aws-samples / aws-ml-data-lake-workshop
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…
☆62Updated 6 years ago
Alternatives and similar repositories for aws-ml-data-lake-workshop:
Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆25Updated 5 years ago
- ☆52Updated 7 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆49Updated 2 years ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated 11 months ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆72Updated 4 years ago
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 3 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- Sample Jupyter Notebooks for Amazon Augmented AI (A2I)☆71Updated last year
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆94Updated 2 years ago
- AI_ML_Workshops☆52Updated 4 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t…☆51Updated 2 months ago
- Bring your own data Labs: Build a serverless data pipeline based on your own data☆44Updated last year
- ☆22Updated 4 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆67Updated 2 years ago
- ☆69Updated 9 months ago
- ☆73Updated last year
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆50Updated last year
- Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…☆17Updated 5 years ago
- ☆158Updated last year
- ☆47Updated last year
- ☆27Updated 4 years ago
- AWS Solution with a CloudFormation template used to deploy an Kinesis Analytics application, optional web server for generating web usage…☆68Updated last year
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆89Updated 6 months ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 4 years ago