aws-samples / aws-ml-data-lake-workshop
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…
☆62Updated 6 years ago
Alternatives and similar repositories for aws-ml-data-lake-workshop:
Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆25Updated 5 years ago
- ☆52Updated 7 years ago
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆49Updated 2 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆94Updated 2 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- AWS Workshop tutorial for building applications with Amazon AI Services☆31Updated 3 years ago
- Sample Jupyter Notebooks for Amazon Augmented AI (A2I)☆71Updated last year
- ☆22Updated 4 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Updated 5 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated 11 months ago
- ☆158Updated last year
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆175Updated 4 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆67Updated 2 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆72Updated 4 years ago
- Bring your own data Labs: Build a serverless data pipeline based on your own data☆44Updated last year
- A serverless framework for continuous machine learning pipeline automation☆14Updated 4 years ago
- Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.☆104Updated 2 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- AI_ML_Workshops☆52Updated 4 years ago
- ☆73Updated last year
- This workshop demonstrates two methods of machine learning inference for global production using AWS Lambda and Amazon SageMaker☆57Updated 4 years ago
- Sample code and datasets for Amazon Fraud Detector☆78Updated last year
- ☆88Updated last year
- ☆27Updated 4 years ago
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t…☆51Updated 3 months ago
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆198Updated last year
- Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…☆165Updated 5 years ago