GoogleCloudPlatform / datacatalog-tag-engine
Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Catalog and Dataplex. Tag Engine is licensed under the Apache 2 license terms. Please make sure to read, understand and agree to the terms of the LICENSE and CONTRIBUTING files before proceeding.
☆51Updated last month
Alternatives and similar repositories for datacatalog-tag-engine:
Users that are interested in datacatalog-tag-engine are comparing it to the libraries listed below
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆49Updated last month
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆97Updated 3 months ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- ☆59Updated last month
- Deploys a secured BigQuery data warehouse☆80Updated last month
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 3 years ago
- ☆46Updated 9 months ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆23Updated 2 months ago
- ☆28Updated 9 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆64Updated 9 months ago
- Data Catalog Tag Templates☆30Updated 4 months ago
- ☆127Updated 9 months ago
- ☆33Updated 6 months ago
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆142Updated 8 months ago
- Data Quality Engine for BigQuery☆264Updated 7 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆123Updated last week
- An end to end demo of Google's Cloud data and analytic stack.☆236Updated last week
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆90Updated 6 months ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆42Updated 2 weeks ago
- Queries to assist with BigQuery cost and performance.☆77Updated 3 months ago
- DMT is an end to end automation of data warehouse migration, focused on extraction, SQL translation, data migration, data validation, etc…☆30Updated last week
- ☆13Updated 4 months ago
- Use Remote Functions to tokenize data with DLP in BigQuery using SQL☆21Updated 2 months ago
- ☆132Updated 3 months ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆156Updated 3 weeks ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆21Updated 2 years ago
- Sample code with integration between Data Catalog and Hive data source.