This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.
☆52Oct 31, 2023Updated 2 years ago
Alternatives and similar repositories for emr-studio-notebook-examples
Users that are interested in emr-studio-notebook-examples are comparing it to the libraries listed below
Sorting:
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆13Jul 12, 2023Updated 2 years ago
- This Guidance, with the sample code, can be used to deploy a carbon data lake to the AWS Cloud using an AWS Cloud Development Kit (AWS CD…☆23Jan 8, 2025Updated last year
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Apr 27, 2025Updated 10 months ago
- Amazon Redshift Serverless RSQL ETL Framework☆10Apr 1, 2025Updated 11 months ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆100Aug 12, 2022Updated 3 years ago
- ☆26Aug 14, 2023Updated 2 years ago
- ☆14Jun 18, 2025Updated 8 months ago
- ☆12Sep 25, 2021Updated 4 years ago
- Optimizing your cost with Rightsizing Recommendations - This tool will print the Rightsizing Recommendations from your AWS Account / Orga…☆13Aug 10, 2023Updated 2 years ago
- The proposed solution shows and approach to unify and centralize logs across different compute platforms like EC2, ECS, EKS and Lambda wi…☆14Oct 17, 2023Updated 2 years ago
- Describes the concepts of lambda architecture and the actual deployment process with an example of building a serverless business intelli…☆15Jun 10, 2025Updated 8 months ago
- ☆27Aug 8, 2024Updated last year
- Example code for running Spark and Hive jobs on EMR Serverless.☆169Jan 8, 2025Updated last year
- ☆13Sep 6, 2023Updated 2 years ago
- Replication utility for AWS Glue Data Catalog☆79Aug 8, 2024Updated last year
- Python script to automatically sync new instances via AWS CodeDeploy APIs☆16Jan 14, 2026Updated last month
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆14Nov 21, 2018Updated 7 years ago
- This Guidance demonstrates how you can extend the data governance capabilities of Amazon DataZone to other Java Database Connectivity (JD…☆13Oct 19, 2024Updated last year
- Sample datasets and code for operationalizing Amazon Fraud Detector using SageMaker DataWrangler, Feature Store, and Pipelines.☆18Dec 1, 2022Updated 3 years ago
- EMR Hudi Workshop content☆12Dec 10, 2021Updated 4 years ago
- ☆15Dec 19, 2025Updated 2 months ago
- ☆20May 21, 2024Updated last year
- ☆17Jan 30, 2024Updated 2 years ago
- End to end example of a Retail Agent implemented with agents for Amazon Bedrock☆38Mar 15, 2024Updated last year
- ☆157Feb 29, 2024Updated 2 years ago
- This solution helps you deploy ETL jobs on data lake using CDK Pipelines.☆69Aug 9, 2022Updated 3 years ago
- Learn more at https://aws.amazon.com/blogs/compute/using-aws-x-ray-tracing-with-amazon-eventbridge/☆15Aug 17, 2022Updated 3 years ago
- ☆24Jan 13, 2025Updated last year
- ☆42Apr 27, 2025Updated 10 months ago
- Learn to build custom prompts and tools for LangChain agents☆39Feb 8, 2024Updated 2 years ago
- A tool to backup and restore AWS Connect, with some useful other utilities too☆21Aug 25, 2022Updated 3 years ago
- Host and AutoScale Gitlab CI/CD runners on AWS☆17Jan 13, 2021Updated 5 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆24Sep 6, 2023Updated 2 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Feb 12, 2025Updated last year
- Examples for AWS-related blog posts☆20Feb 21, 2026Updated last week
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆91Dec 29, 2022Updated 3 years ago
- ☆25Oct 12, 2023Updated 2 years ago
- A platform to help security researchers develop and test machine learning-based security services based on time-series data, with the abi…☆18Apr 9, 2019Updated 6 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Jul 23, 2020Updated 5 years ago