mozilla / docker-etlLinks
Collection of dockerized ETL jobs managed by data engineering.
☆19Updated this week
Alternatives and similar repositories for docker-etl
Users that are interested in docker-etl are comparing it to the libraries listed below
Sorting:
- LookML Generator for Glean and Mozilla Data☆21Updated this week
- End-to-end DataOps platform deployed by Terraform.☆67Updated 3 months ago
- ☆46Updated last year
- A Python API for Asynchronously Loading Data into Snowflake DB -☆66Updated 3 weeks ago
- Airflow configuration for Telemetry☆191Updated last week
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆56Updated this week
- Data Catalog Tag Templates☆30Updated 2 months ago
- Apache Airflow CI pipeline☆19Updated 6 years ago
- Sample code with integration between Data Catalog and RDBMS data sources.☆71Updated 3 years ago
- Bigquery ETL☆320Updated this week
- Extension dtypes for pandas corresponding to GoogleSQL data types such as DATE, TIME, and JSON.☆31Updated this week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-orchestration-airflow☆15Updated 2 years ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- ☆22Updated 4 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-datacatalog☆52Updated 2 years ago
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆83Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week
- ETL jobs for Firefox Telemetry☆28Updated 2 months ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆160Updated 4 months ago
- Weekly Data Engineering Newsletter☆95Updated 11 months ago
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆93Updated 11 months ago
- Fivetran data models for QuickBooks using dbt.☆34Updated 2 weeks ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆113Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-iam☆37Updated last year
- Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Cat…☆56Updated last week