sodadata / soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
☆63Updated 2 years ago
Alternatives and similar repositories for soda-spark:
Users that are interested in soda-spark are comparing it to the libraries listed below
- A Python Library to support running data quality rules while the spark job is running⚡☆176Updated last week
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆146Updated this week
- A repository of sample code to accompany our blog post on Airflow and dbt.☆170Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆207Updated 3 weeks ago
- Delta Lake helper methods. No Spark dependency.☆23Updated 6 months ago
- The athena adapter plugin for dbt (https://getdbt.com)☆140Updated last year
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆59Updated 5 months ago
- Great Expectations Airflow operator☆161Updated this week
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆193Updated last month
- Delta Lake helper methods in PySpark☆322Updated 6 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆91Updated last year
- Delta lake and filesystem helper methods☆51Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆64Updated 5 months ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆43Updated 8 months ago
- This repository contains the dbt-glue adapter☆112Updated last week
- Spark style guide☆258Updated 5 months ago
- a dbt package to make auditing dbt runs easy.☆98Updated 3 months ago
- Apache Airflow integration for dbt☆400Updated 10 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- ☆49Updated 8 months ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- dbt adapter for Athena☆38Updated 9 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- A re-usable Snowflake helper package for dbt☆53Updated 5 years ago
- Collection of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆47Updated 2 years ago
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆119Updated this week
- ☆198Updated last year