mozilla / gcp-ingestion
Documentation and implementation of telemetry ingestion on Google Cloud Platform
☆83Updated last week
Alternatives and similar repositories for gcp-ingestion:
Users that are interested in gcp-ingestion are comparing it to the libraries listed below
- Bigquery ETL☆294Updated this week
- Airflow configuration for Telemetry☆186Updated this week
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆54Updated last month
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- ☆47Updated 11 months ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆92Updated 8 months ago
- ☆128Updated 11 months ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆127Updated 3 weeks ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- Commons code used by the Data Catalog connectors, and links for the connectors sample code.☆61Updated 3 years ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆160Updated 2 months ago
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 2 months ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆43Updated 3 weeks ago
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆143Updated 10 months ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated last month
- ☆60Updated 3 weeks ago
- ☆28Updated 11 months ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- LookML Generator for Glean and Mozilla Data☆20Updated this week
- ☆66Updated 8 months ago
- Creates opinionated BigQuery datasets and tables☆215Updated this week
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- Data Quality Engine for BigQuery☆270Updated 9 months ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆195Updated this week
- Advertising Data Lakes and Workflow Automation☆50Updated 4 years ago
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆102Updated 7 months ago
- Cask Hydrator Plugins Repository☆68Updated last week
- ☆31Updated 6 years ago