Automated data quality suggestions and analysis with Deequ on AWS Glue
☆91Dec 29, 2022Updated 3 years ago
Alternatives and similar repositories for amazon-deequ-glue
Users that are interested in amazon-deequ-glue are comparing it to the libraries listed below
Sorting:
- Python API for Deequ☆814Jan 21, 2026Updated last month
- Python API for Deequ☆41Nov 10, 2020Updated 5 years ago
- Replication utility for AWS Glue Data Catalog☆79Aug 8, 2024Updated last year
- Amazon Managed Service for Apache Flink Benchmarking Utility helps with capacity planning, integration testing, and benchmarking of Amazo…☆21Aug 30, 2023Updated 2 years ago
- ☆23Oct 3, 2024Updated last year
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Oct 17, 2023Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,588Feb 17, 2026Updated 2 weeks ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Oct 31, 2023Updated 2 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Sample demonstrating consuming Amazon Cognito Streams☆10Jun 15, 2020Updated 5 years ago
- ☆157Feb 29, 2024Updated 2 years ago
- A code-free AutoML pipeline with AutoGluon, Amazon SageMaker, and AWS Lambda.☆11Aug 5, 2021Updated 4 years ago
- ☆12Aug 9, 2024Updated last year
- An open source development framework to help you build data workflows and modern data architecture on AWS.☆271Feb 9, 2026Updated 3 weeks ago
- A tool to automate analytic platform evaluations. Barometer helps customers to get data points needed for service selection/service confi…☆19Jun 3, 2024Updated last year
- Usage examples for byte-genie API☆12Apr 27, 2024Updated last year
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 2 years ago
- ☆20May 21, 2024Updated last year
- Enterprise-grade, production-hardened, serverless data lake on AWS☆479Oct 1, 2025Updated 5 months ago
- A collection of examples built with AWS DataOps Development Kit (DDK)☆43Jan 7, 2026Updated last month
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆31Aug 19, 2021Updated 4 years ago
- Some crazy experiments☆35Sep 3, 2025Updated 6 months ago
- ☆17Jul 21, 2025Updated 7 months ago
- AWS Glue code samples☆1,537Nov 5, 2025Updated 4 months ago
- Amazon SageMaker MLOps deployment pipeline for A/B Testing of machine learning models.☆45Jun 7, 2021Updated 4 years ago
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆201Jun 15, 2023Updated 2 years ago
- Amazon Kinesis Data Analytics Flink Starter Kit helps you with the development of Flink Application with Kinesis Stream as a source and A…☆47Aug 30, 2023Updated 2 years ago
- Dev fabric solution for SQL Server☆18Jun 9, 2021Updated 4 years ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆46May 10, 2024Updated last year
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆24Sep 6, 2023Updated 2 years ago
- ☆16Jan 31, 2022Updated 4 years ago
- Curated list of resources about Apache Airflow☆19Apr 7, 2021Updated 4 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Jul 23, 2020Updated 5 years ago
- Lab Instructions for Data Engineering Immersion Day☆197Jan 26, 2026Updated last month
- ☆89Nov 6, 2023Updated 2 years ago
- SageMaker specific extensions to TensorFlow.☆54Jul 23, 2024Updated last year
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.☆605Feb 24, 2026Updated last week
- AWS Quick Start Team☆20Oct 3, 2024Updated last year