great-expectations / great_expectations_actionLinks
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
☆81Updated last year
Alternatives and similar repositories for great_expectations_action
Users that are interested in great_expectations_action are comparing it to the libraries listed below
Sorting:
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Great Expectations Airflow operator☆169Updated last week
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆89Updated 3 years ago
- Templates for your Kedro projects.☆80Updated this week
- Code examples showing flow deployment to various types of infrastructure☆111Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆45Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 5 months ago
- Deploy production-grade Metaflow cloud infrastructure on AWS☆69Updated 3 weeks ago
- A frictionless integrated platform for notebook☆82Updated 2 years ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆72Updated 6 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last month
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 6 months ago
- Make simple storing test results and visualisation of these in a BI dashboard☆52Updated 2 months ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆186Updated 2 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- pytest plugin to run the tests with support of pyspark☆86Updated 6 months ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Collection of code snippets for blogs, conferences, and talks☆24Updated 3 years ago
- Black for Databricks notebooks☆47Updated 6 months ago
- A GitHub Action to lint, test, build-docs, package, and run your kedro pipelines. Supports any Python version you'll give it (that is als…☆19Updated last week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆229Updated last month