GoogleCloudPlatform / dataproc-templates
Dataproc templates and pipelines for solving simple in-cloud data tasks
☆116Updated this week
Related projects: ⓘ
- ☆55Updated last week
- An end to end demo of Google's Cloud data and analytic stack.☆212Updated last week
- Data Quality Engine for BigQuery☆255Updated 2 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 4 months ago
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆81Updated this week
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆142Updated 3 months ago
- ☆11Updated 9 months ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆45Updated last week
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆48Updated last week
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆150Updated last week
- ☆122Updated 4 months ago
- ☆126Updated 4 months ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆369Updated last week
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- ☆46Updated 4 months ago
- Repository for Beam College sessions☆101Updated 3 years ago
- ☆30Updated 5 months ago
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 2 years ago
- Data Foundation - Google Cloud Cortex Framework☆158Updated 3 weeks ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 2 years ago
- Creates opinionated BigQuery datasets and tables☆190Updated this week
- Data pipeline with dbt, Airflow, Great Expectations☆155Updated 3 years ago
- A dbt adapter for Databricks.☆211Updated this week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated 10 months ago
- ☆25Updated this week
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 2 years ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆135Updated this week
- Utility to compare data between homogeneous or heterogeneous environments to ensure source and target tables match☆397Updated this week
- Deploys a secured BigQuery data warehouse☆77Updated last month