aws-samples / aws-ml-data-lake-workshopLinks
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…
☆63Updated 6 years ago
Alternatives and similar repositories for aws-ml-data-lake-workshop
Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below
Sorting:
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Updated 6 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 3 years ago
- ☆52Updated 7 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆93Updated 2 years ago
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆49Updated 2 years ago
- Open innovation with 60 minute cloud experiments on AWS☆88Updated last year
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆68Updated 2 years ago
- ☆22Updated 4 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- ☆88Updated last year
- ☆158Updated last year
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆175Updated 5 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker☆18Updated 6 years ago
- Bring your own data Labs: Build a serverless data pipeline based on your own data☆44Updated 2 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Replication utility for AWS Glue Data Catalog☆79Updated 10 months ago
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t…☆51Updated 5 months ago
- This workshop demonstrates two methods of machine learning inference for global production using AWS Lambda and Amazon SageMaker☆59Updated 4 years ago
- AI_ML_Workshops☆52Updated 4 years ago
- A serverless framework for continuous machine learning pipeline automation☆14Updated 4 years ago
- Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…☆165Updated 5 years ago
- Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…☆17Updated 5 years ago
- ☆74Updated last year