greatexpectationslabs / great_expectations_lambdaLinks
Minimal deployment of Great Expectations on lambda
β11Updated 5 years ago
Alternatives and similar repositories for great_expectations_lambda
Users that are interested in great_expectations_lambda are comparing it to the libraries listed below
Sorting:
- Tough and flexible tools for data analysis, transformation, validation and movement.β139Updated last year
- π Docker image for AWS Glue Spark/Pythonβ23Updated last year
- Automated data quality suggestions and analysis with Deequ on AWS Glueβ85Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsβ29Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesβ63Updated 2 years ago
- Run dbt serverless in the Cloud (AWS)β42Updated 5 years ago
- A Getting Started Guide for developing and using Airflow Pluginsβ93Updated 6 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β168Updated last year
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.β22Updated 9 years ago
- (project & tutorial) dag pipeline tests + ci/cd setupβ88Updated 4 years ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobsβ43Updated last year
- DBT Cloud Plugin for Airflowβ38Updated last year
- Example orchestration pipeline for Fivetran + dbt managed by Airflowβ22Updated 4 years ago
- Build DataOps platform with Apache Airflow and dbt on AWSβ55Updated 4 years ago
- dbt adapter for Athenaβ38Updated last year
- Great Expectations Airflow operatorβ165Updated this week
- Python API for Deequβ41Updated 4 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.β111Updated 2 years ago
- Visualize dependencies between Airflow DAGsβ49Updated 4 years ago
- Glue VSCode devcontainer setupβ14Updated 2 years ago
- Composable filesystem hooks and operators for Apache Airflow.β17Updated 3 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.β80Updated last year
- PySpark data-pipeline testing andΒ CICDβ28Updated 4 years ago
- Demo for GitHub Universe 2022β12Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.β48Updated last year
- This repository contains the dbt-glue adapterβ123Updated this week
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRβ37Updated 3 months ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.β51Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Coreβ33Updated last year
- β34Updated 2 years ago