aws-samples / aws-ml-data-lake-workshop
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…
☆61Updated 6 years ago
Alternatives and similar repositories for aws-ml-data-lake-workshop:
Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Updated 5 years ago
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆48Updated last year
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 2 years ago
- ☆52Updated 7 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆92Updated 2 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆67Updated 2 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 3 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated 8 months ago
- ☆158Updated 10 months ago
- Bring your own data Labs: Build a serverless data pipeline based on your own data☆42Updated last year
- Repository for AWS Glue Workshop☆31Updated 2 years ago
- ☆22Updated 4 years ago
- AI_ML_Workshops☆51Updated 4 years ago
- Sample Jupyter Notebooks for Amazon Augmented AI (A2I)☆70Updated last year
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 4 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated 7 months ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- ☆87Updated last year
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆49Updated last year
- Build Train and Deploy your own custom container using AWS StepFunctions Data Science SDK☆22Updated 4 years ago
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t…☆51Updated 2 weeks ago
- ☆26Updated 4 years ago
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆85Updated last year
- Replication utility for AWS Glue Data Catalog☆75Updated 5 months ago