MrPowers / chispa
PySpark test helper methods with beautiful error messages
β686Updated 3 weeks ago
Alternatives and similar repositories for chispa:
Users that are interested in chispa are comparing it to the libraries listed below
- pyspark methods to enhance developer productivity π£ π― πβ669Updated 2 months ago
- Python API for Deequβ766Updated last month
- Delta Lake helper methods in PySparkβ322Updated 8 months ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β187Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflowβ215Updated last week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for severβ¦β245Updated 3 months ago
- Spark style guideβ258Updated 7 months ago
- Port(ish) of Great Expectations to dbt test macrosβ1,162Updated 4 months ago
- Apache Airflow integration for dbtβ403Updated 11 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricksβ427Updated 3 months ago
- Template for a data contract used in a data mesh.β472Updated last year
- Data pipeline with dbt, Airflow, Great Expectationsβ162Updated 3 years ago
- Delta Lake examplesβ224Updated 7 months ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β439Updated 3 weeks ago
- The athena adapter plugin for dbt (https://getdbt.com)β249Updated 3 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β645Updated 3 weeks ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.β197Updated this week
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifactsβ358Updated this week
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.β249Updated 3 years ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β368Updated this week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β168Updated last year
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.β433Updated 2 months ago
- Great Expectations Airflow operatorβ163Updated last week
- Collection of dbt Tips and Tricksβ386Updated 2 years ago
- Turning PySpark Into a Universal DataFrame APIβ390Updated last week
- Macros that generate dbt codeβ560Updated last month
- dbt macros to stage external sourcesβ334Updated last month
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β538Updated last month
- Useful macros when performing data auditsβ356Updated 3 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.β172Updated last year