dunnhumby / democratizing-dataproc
Using terraform, deploy multiple dataproc clusters using a shared hive metastore
☆15Updated 2 years ago
Alternatives and similar repositories for democratizing-dataproc:
Users that are interested in democratizing-dataproc are comparing it to the libraries listed below
- ☆47Updated 10 months ago
- ☆54Updated 7 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- Uses Cloud Build to deploy a scalable batch ingestion pipeline consisting of GCS, Cloud Functions, Dataflow and BigQuery☆22Updated 2 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆91Updated 7 months ago
- A Singer (https://singer.io) target that writes data to Google BigQuery.☆39Updated 4 years ago
- An open source library for BigQuery testing.☆14Updated 2 years ago
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- Styles for dbt on the net☆9Updated 3 months ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- ☆13Updated 5 months ago
- Helm chart for deploying Apache Airflow in kubernetes☆19Updated 5 years ago
- ☆24Updated 4 years ago
- Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs…☆88Updated 11 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Utilities to help HBase as a service in HDInsight Azure☆14Updated last year
- ☆33Updated 11 months ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated last month
- Data models for snowplow analytics.☆127Updated last month
- Create backups of BigQuery datasets/tables☆40Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- BigQuery Schema Conversion Tool☆23Updated 4 years ago
- Data Catalog Tag Templates☆30Updated 5 months ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆60Updated 5 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆147Updated 8 years ago
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 2 months ago
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆83Updated this week
- ☆63Updated this week