AWS Glue code samples
☆1,534Jun 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for aws-glue-samples
Users that are interested in aws-glue-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆701Apr 24, 2026Updated 2 months ago
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆201Jun 15, 2023Updated 3 years ago
- A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.☆345Mar 29, 2024Updated 2 years ago
- pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…☆4,110Jun 25, 2026Updated last week
- Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment☆2,808Sep 3, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆71May 8, 2026Updated last month
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆230May 18, 2026Updated last month
- Enterprise-grade, production-hardened, serverless data lake on AWS☆480Oct 1, 2025Updated 9 months ago
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆611Jun 25, 2026Updated last week
- CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries☆177Apr 28, 2020Updated 6 years ago
- A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.☆158Mar 24, 2020Updated 6 years ago
- A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically c…☆398Jun 3, 2024Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,625Jun 25, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆893Jul 15, 2022Updated 3 years ago
- Amazon Redshift Advanced Monitoring☆270Oct 28, 2025Updated 8 months ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆93Dec 29, 2022Updated 3 years ago
- A collection of example UDFs for Amazon Redshift.☆244Jun 11, 2026Updated 3 weeks ago
- Amazon Redshift Database Loader implemented in AWS Lambda☆595Jul 16, 2024Updated last year
- Glue scripts for converting AWS Service Logs for use in Athena☆139Feb 1, 2024Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆186Jan 26, 2022Updated 4 years ago
- Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.☆10,963Jun 24, 2026Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆170Jun 8, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Nov 28, 2018Updated 7 years ago
- The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation …☆9,560Jun 23, 2026Updated last week
- ☆157Feb 29, 2024Updated 2 years ago
- ☆34Mar 20, 2024Updated 2 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Jun 15, 2023Updated 3 years ago
- Code and walkthrough labs to set up serverless applications for Wild Rydes workshops☆4,265Jul 29, 2024Updated last year
- Redshift Python Connector. It supports Python Database API Specification v2.0.☆219Jun 10, 2026Updated 3 weeks ago
- ☆73Nov 10, 2023Updated 2 years ago
- Example projects using the AWS CDK☆5,608Jun 22, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python Serverless Microframework for AWS☆11,062Jun 25, 2026Updated last week
- Python API for Deequ☆822Jun 11, 2026Updated 3 weeks ago
- ☆17May 16, 2020Updated 6 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Mar 31, 2021Updated 5 years ago
- An open-source framework that simplifies implementation of data solutions.☆146Dec 2, 2025Updated 7 months ago
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆29Feb 5, 2024Updated 2 years ago
- CLI tool to build, test, debug, and deploy Serverless applications using AWS SAM☆6,734Updated this week