GoogleCloudPlatform / datacatalog-tag-engine
Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Catalog and Dataplex. Tag Engine is licensed under the Apache 2 license terms. Please make sure to read, understand and agree to the terms of the LICENSE and CONTRIBUTING files before proceeding.
☆49Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for datacatalog-tag-engine
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆83Updated this week
- Deploys a secured BigQuery data warehouse☆77Updated 2 months ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 2 years ago
- ☆57Updated 2 weeks ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆46Updated 2 weeks ago
- Data Catalog Tag Templates☆29Updated 3 weeks ago
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 2 years ago
- ☆46Updated 6 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 6 months ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆21Updated 2 months ago
- ☆28Updated 6 months ago
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆142Updated 5 months ago
- Data Quality Engine for BigQuery☆258Updated 3 months ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 6 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆118Updated this week
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆89Updated 2 months ago
- ☆12Updated last month
- An end to end demo of Google's Cloud data and analytic stack.☆224Updated this week
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆41Updated this week
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆150Updated this week
- Creates opinionated BigQuery datasets and tables☆201Updated this week
- ☆126Updated 6 months ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 2 years ago
- The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.☆22Updated this week
- ☆31Updated 3 months ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆21Updated last year
- Deploys a Lakehouse Architecture Solution☆33Updated last week
- Opinionated setup for securely using AI Platform Notebooks.☆42Updated 3 months ago
- ☆128Updated last month
- Queries to assist with BigQuery cost and performance.☆70Updated last week