apache / beam-starter-python
Apache Beam starter repo for Python
☆19Updated last month
Alternatives and similar repositories for beam-starter-python:
Users that are interested in beam-starter-python are comparing it to the libraries listed below
- The go to demo for public and private dbt Learn☆76Updated 6 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆66Updated 10 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆161Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Package to assert rows in-line with dbt macros.☆66Updated last week
- ☆75Updated 5 months ago
- ☆51Updated 2 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆170Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆83Updated this week
- ☆134Updated 4 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆126Updated 2 weeks ago
- dbt Cloud Terraform Provider☆95Updated this week
- Data Quality Engine for BigQuery☆267Updated 8 months ago
- Pytest plugin for dbt core☆59Updated 2 months ago
- ☆198Updated last year
- ☆60Updated 2 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆54Updated last week
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆22Updated 3 weeks ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- Macros for generating dbt model data profiles☆86Updated 4 months ago
- BigQuery Column Lineage parser☆60Updated 7 months ago
- ☆16Updated 7 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆180Updated last week
- Faker for Snowflake!☆33Updated 2 years ago
- Snowflake-specific utility macros for dbt projects.☆108Updated 9 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆132Updated 8 months ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆158Updated last month
- dbt adapter for Teradata☆22Updated last week
- A style guide and linter for Looker's LookML data modeling language☆134Updated 2 weeks ago