apache / beam-starter-python
Apache Beam starter repo for Python
☆18Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for beam-starter-python
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 6 months ago
- ☆64Updated 3 weeks ago
- Define, govern, and model event data for warehouse-first product analytics.☆81Updated 4 months ago
- ☆20Updated 2 weeks ago
- ☆128Updated last month
- ☆126Updated 6 months ago
- A bunch of hacks developed around dbt☆48Updated 4 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- Rules based grant management for Snowflake☆40Updated 5 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆166Updated last year
- Delta Lake Documentation☆46Updated 4 months ago
- ☆20Updated 3 years ago
- A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt☆112Updated 2 years ago
- Run your dbt models efficiently using dbt_smart_run☆12Updated last year
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆46Updated 2 weeks ago
- The go to demo for public and private dbt Learn☆69Updated 2 months ago
- a dbt package to make auditing dbt runs easy.☆97Updated last year
- ☆196Updated last year
- BigQuery Column Lineage parser☆56Updated 2 months ago
- A framework to manage data, continuously☆27Updated 2 months ago
- All the basics to get a nice containerized dbt development environment☆57Updated 2 years ago
- Test all the data☆37Updated last year
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆179Updated last year
- dbt support for database features which are not yet supported natively in dbt-core☆145Updated 4 months ago
- Weekly Data Engineering Newsletter☆93Updated 3 months ago
- This repository contains files for the metrics framework playbook.☆36Updated 2 years ago
- Cloned by the `dbt init` task☆59Updated 6 months ago