dunnhumby / democratizing-dataprocLinks
Using terraform, deploy multiple dataproc clusters using a shared hive metastore
☆15Updated 2 years ago
Alternatives and similar repositories for democratizing-dataproc
Users that are interested in democratizing-dataproc are comparing it to the libraries listed below
Sorting:
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- ☆54Updated 7 years ago
- Tools for creating Dataproc custom images☆34Updated last month
- ☆33Updated last year
- ☆47Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs…☆87Updated 11 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆92Updated 10 months ago
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- Replicates data between Google Cloud BigQuery projects☆21Updated 8 years ago
- Create backups of BigQuery datasets/tables☆40Updated last year
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆103Updated 9 months ago
- ☆13Updated 2 weeks ago
- Ephemeral Hadoop clusters using Google Compute Platform☆136Updated 3 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 5 years ago
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆44Updated last month
- CloudEvent Types for Python☆27Updated this week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-datacatalog☆52Updated 2 years ago
- Relational Database Import to Big Query with Dataflow and DLP API☆18Updated 5 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- A serverless bot which periodically checks configured BigQuery capacity commitments, reservations and assignments against actual slot con…☆24Updated 2 months ago
- An open source library for BigQuery testing.☆14Updated 3 years ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- Example Kubernetes app that shows how to build a 'pipeline' to stream data into BigQuery. Uses Redis or Google Cloud PubSub☆129Updated 4 years ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 3 months ago
- ☆14Updated 3 years ago
- Database plugins☆14Updated 2 weeks ago
- Shoprunner Terraform provider - Open Source initiative☆37Updated 5 years ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆61Updated 5 years ago