aws-samples / aws-ml-data-lake-workshop
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…
☆63Updated 6 years ago
Alternatives and similar repositories for aws-ml-data-lake-workshop:
Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below
- ☆52Updated 7 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆25Updated 6 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 3 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- AI_ML_Workshops☆52Updated 4 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆157Updated 5 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆68Updated 2 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆94Updated 2 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated last year
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆49Updated 2 years ago
- ☆74Updated last year
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆175Updated 4 years ago
- ☆158Updated last year
- ☆88Updated last year
- ☆22Updated 4 years ago
- Bring your own data Labs: Build a serverless data pipeline based on your own data☆44Updated last year
- ☆27Updated 4 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 4 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆50Updated last year
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated 10 months ago
- A solutions that automatically configures the AWS services necessary to easily capture, store, process, and deliver streaming data. This …☆93Updated 2 months ago
- A serverless framework for continuous machine learning pipeline automation☆14Updated 4 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated last year
- Repository for AWS Glue Workshop☆31Updated 2 years ago
- Sample Jupyter Notebooks for Amazon Augmented AI (A2I)☆71Updated last year