ael-computas / gcp-cloud-composer-pod-operatorLinks
Contains example dags and terraform code to create a composer with a node pool to run pods
☆13Updated 4 years ago
Alternatives and similar repositories for gcp-cloud-composer-pod-operator
Users that are interested in gcp-cloud-composer-pod-operator are comparing it to the libraries listed below
Sorting:
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 4 months ago
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- ☆47Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- ☆24Updated 5 years ago
- Examples for High Performance Spark☆15Updated 7 months ago
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Updated 7 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help lev…☆22Updated 2 years ago
- A template DBT project for BigQuery on Google Cloud☆12Updated 4 years ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆55Updated last week
- a pytest plugin for dbt adapter test suites☆19Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆68Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 4 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Updated 4 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 5 years ago
- ☆31Updated 6 years ago
- dbt adwords models☆18Updated 4 months ago
- IPython magics to work with DBT☆15Updated 2 years ago
- Update a Google Data Catalog tag with dbt Cloud run metadata☆22Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 2 months ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year