aws-samples/aws-ml-data-lake-workshop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/aws-ml-data-lake-workshop)

aws-samples / aws-ml-data-lake-workshop

As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using…

☆63

Alternatives and similar repositories for aws-ml-data-lake-workshop

Users that are interested in aws-ml-data-lake-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-building-data-lake-reinvent-session-stg206
View on GitHub
Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…
☆26Apr 9, 2019Updated 7 years ago
aws-samples / amazon-serverless-datalake-workshop
View on GitHub
A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.
☆158Mar 24, 2020Updated 6 years ago
aws-samples / aws-dbs-refarch-datalake
View on GitHub
Reference Architectures for Datalakes on AWS
☆78May 13, 2020Updated 6 years ago
aws-samples / serverless-data-analytics
View on GitHub
CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries
☆177Apr 28, 2020Updated 6 years ago
aws-samples / amazon-forecast-automation
View on GitHub
☆16Jul 29, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
awslabs / aws-sagemaker-emr-tutorial
View on GitHub
Supporting code, Dockerfile, and Jupyter notebook for an end to end tutorial on Amazon SageMaker and EMR.
☆27Jan 14, 2026Updated 6 months ago
aws-samples / cloud-builders-day-elastic-beanstalk-workshop
View on GitHub
The objective of Cloud Builders' Day repository is to provide do-it-yourself lab guides for several AWS services including but not limite…
☆11Aug 20, 2020Updated 5 years ago
aws-samples / aws-stepfunction-cicd-pipeline-example
View on GitHub
This repository contains an example of creating a pipeline to deploy an AWS Step Function State Machine.
☆21Jun 11, 2020Updated 6 years ago
aws-samples / aws-dbs-refarch-edw
View on GitHub
Repository for AWS DBS Reference Architectures - Enterprise Data Warehousing
☆35Jan 17, 2019Updated 7 years ago
aws-samples / data-purging-aws-data-lake
View on GitHub
☆22Jul 14, 2020Updated 6 years ago
aws-samples / aws-lakeformation-ml-transforms
View on GitHub
Workshop - Using AWS Lake Formation ML Transforms to cleanse the data in a data lake
☆14Dec 22, 2019Updated 6 years ago
aws-samples / aws-codepipeline-cross-region-continuous-deployment
View on GitHub
Cross region continuous deployment pipeline for developing high availability applications using AWS CodePipeline
☆12Nov 15, 2018Updated 7 years ago
aws-samples / aws-lambda-etl-ref-architecture
View on GitHub
This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…
☆24Jun 16, 2020Updated 6 years ago
aws-solutions / aws-data-lake-solution
View on GitHub
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically c…
☆399Jun 3, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws-samples / amazon-sagemaker-predict-accessibility
View on GitHub
Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC
☆15Jul 9, 2020Updated 6 years ago
aws-samples / aws-devsecops-workshop
View on GitHub
In this workshop we will build a pipeline for a sample WordPress site in a stack. We will explore how to validate, lint and test template…
☆18May 20, 2024Updated 2 years ago
aws-samples / aws-cicd-bluegreen
View on GitHub
Lets customers use Clouformation to create a VPC with ASG and two load balanced web servers. Customer will create a code pipeline to auto…
☆17Dec 2, 2019Updated 6 years ago
aws-samples / amazon-lex-bot-deploy
View on GitHub
The sample code provides a deploy function and an executable to easily deploy an Amazon Lex bot based on a Lex Schema file.
☆23Nov 2, 2023Updated 2 years ago
dylan-tong-aws / aws-serverless-ml-pipeline
View on GitHub
A serverless framework for continuous machine learning pipeline automation
☆14Aug 18, 2020Updated 5 years ago
aws-samples / accelerated-data-lake
View on GitHub
A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch
☆73Dec 27, 2020Updated 5 years ago
aws-samples / aws-dbs-refarch-rdbms
View on GitHub
Reference Architectures for Relational Databases on AWS
☆26Dec 1, 2020Updated 5 years ago
ran-isenberg / appsync-events-client
View on GitHub
AppSync Events frontend sample implementation
☆12Nov 16, 2024Updated last year
aws-samples / aws-etl-orchestrator
View on GitHub
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
☆345Mar 29, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
thomasnield / oreilly-probability-from-scratch
View on GitHub
☆18Jun 6, 2022Updated 4 years ago
aws-samples / aws-waf-classic-workshop
View on GitHub
A workshop about AWS WAF Classic and the WAF Security Automations Solution
☆20Feb 19, 2021Updated 5 years ago
aws-samples / amazon-sagemaker-cdk-examples
View on GitHub
amazon-sagemaker-cdk-examples uses AWS CDK to simplify common architectures in machine leaning operations using Sagemaker and other AWS s…
☆69Mar 28, 2024Updated 2 years ago
aws-samples / redshift-immersionday-labs
View on GitHub
This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.
☆53Mar 31, 2021Updated 5 years ago
aws-samples / aws-sagemaker-ml-blog-predictive-campaigns
View on GitHub
Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker
☆18Feb 10, 2019Updated 7 years ago
aws-samples / aws-alexa-workshop
View on GitHub
Learn how to build Alexa Skills with AWS Services.
☆26May 20, 2024Updated 2 years ago
aws-samples / data-engineering-for-aws-immersion-day
View on GitHub
Lab Instructions for Data Engineering Immersion Day
☆198Jan 26, 2026Updated 6 months ago
aws-samples / aws-transcribe-captioning-tools
View on GitHub
Convert AWS Transcribe output into multiple caption formats.
☆94Oct 12, 2020Updated 5 years ago
aws-samples / aws-cloudtrail-analyzer-workshop
View on GitHub
Workshop exercise materials for re:Invent 2017 - SID 341: Using AWS CloudTrail Logs for Scalable, Automated Anomaly Detection
☆54Apr 8, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pycaret / pycaret-deployment-aws
View on GitHub
Deploy Machine Learning Pipeline on AWS Fargate
☆13Dec 8, 2022Updated 3 years ago
aws-samples / aws-cross-account-serverless-microservices
View on GitHub
This repo contains a sample application composed of a web application supported by two serverless microservices. The microservices will b…
☆36Nov 10, 2023Updated 2 years ago
aws-solutions-library-samples / guidance-for-clickstream-analytics-on-aws
View on GitHub
Guidance for Clickstream Analytics on AWS source code
☆87Oct 13, 2025Updated 9 months ago
aws-samples / aws-sagemaker-build
View on GitHub
Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…
☆164Jan 15, 2020Updated 6 years ago
aws-samples / amazon-efs-workshop
View on GitHub
This workshop will show solutions architects how to take advantage of a petabyte scale distributed file system for various application wo…
☆28Jul 7, 2020Updated 6 years ago
aws-samples / document-processing-pipeline-for-regulated-industries
View on GitHub
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata ser…
☆67Oct 25, 2021Updated 4 years ago
aws-samples / amazon-sagemaker-cloudformation-custom-resource
View on GitHub
Deploy Amazon SageMaker notebook using CloudFormation custom resource
☆18May 8, 2018Updated 8 years ago