mozilla / gcp-ingestion
Documentation and implementation of telemetry ingestion on Google Cloud Platform
☆83Updated this week
Alternatives and similar repositories for gcp-ingestion
Users that are interested in gcp-ingestion are comparing it to the libraries listed below
Sorting:
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆92Updated 9 months ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- ☆128Updated last year
- ☆47Updated last year
- Airflow configuration for Telemetry☆186Updated this week
- Bigquery ETL☆298Updated this week
- ☆66Updated 9 months ago
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆54Updated this week
- Sample code with integration between Data Catalog and RDBMS data sources.☆72Updated 3 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- Dataproc templates and pipelines for solving in-cloud data tasks☆128Updated last month
- ☆29Updated last year
- Sample code with integration between Data Catalog and Hive data source.☆25Updated 3 months ago
- ☆69Updated this week
- Collection of dockerized ETL jobs managed by data engineering.☆20Updated last week
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆43Updated 2 weeks ago
- A guide for Mozilla's developers and data scientists to analyze and interpret the data gathered by our data collection systems.☆89Updated last week
- ☆48Updated this week
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆55Updated 3 weeks ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆196Updated last week
- Data Catalog Tag Templates☆30Updated this week
- Database plugins☆14Updated last week
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆159Updated 2 months ago
- ☆76Updated this week
- LookML Generator for Glean and Mozilla Data☆20Updated this week
- End-to-end DataOps platform deployed by Terraform.☆66Updated last month
- Open source tools for Google Cloud Storage and Databases.☆62Updated last year
- ☆17Updated 7 months ago