GoogleCloudPlatform / datacatalog-tag-engine
Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Catalog and Dataplex. Tag Engine is licensed under the Apache 2 license terms. Please make sure to read, understand and agree to the terms of the LICENSE and CONTRIBUTING files before proceeding.
☆53Updated this week
Alternatives and similar repositories for datacatalog-tag-engine:
Users that are interested in datacatalog-tag-engine are comparing it to the libraries listed below
- Deploys a secured BigQuery data warehouse☆82Updated last week
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆53Updated this week
- ☆60Updated 2 months ago
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 3 years ago
- Utility to identify and rewrite common anti patterns in BigQuery SQL syntax☆98Updated 4 months ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆23Updated this week
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- ☆28Updated 10 months ago
- Data Catalog Tag Templates☆30Updated 5 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆125Updated 2 weeks ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- ☆47Updated 10 months ago
- Data Quality Engine for BigQuery☆266Updated 8 months ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆158Updated last month
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆142Updated 9 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆66Updated 10 months ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆90Updated 7 months ago
- ☆13Updated 5 months ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated last month
- Use Remote Functions to tokenize data with DLP in BigQuery using SQL☆21Updated 4 months ago
- DMT is an end to end automation of data warehouse migration, focused on extraction, SQL translation, data migration, data validation, etc…☆32Updated this week
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- ☆34Updated 7 months ago
- ☆128Updated 10 months ago
- Deploys a Lakehouse Architecture Solution☆37Updated last week
- Creates opinionated BigQuery datasets and tables☆208Updated this week
- Queries to assist with BigQuery cost and performance.☆82Updated 4 months ago
- An end to end demo of Google's Cloud data and analytic stack.☆240Updated this week
- DIY commercial datasets on Google Cloud Platform☆88Updated 2 weeks ago
- ☆134Updated 4 months ago