Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
☆600Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for initialization-actions
Users that are interested in initialization-actions are comparing it to the libraries listed below
Sorting:
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆289Mar 10, 2026Updated last week
- Cloud Dataproc: Samples and Utils☆206Mar 11, 2026Updated last week
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆422Mar 6, 2026Updated last week
- Labs and demos for courses for GCP Training (http://cloud.google.com/training).☆8,470Mar 12, 2026Updated last week
- Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially…☆3,010Updated this week
- Cloud Dataflow Google-provided templates for solving in-Cloud data tasks☆1,288Updated this week
- ☆27May 1, 2024Updated last year
- ☆14May 27, 2022Updated 3 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Dec 14, 2019Updated 6 years ago
- ☆85Jan 26, 2026Updated last month
- Cloud ML Engine repo. Please visit the new Vertex AI samples repo at https://github.com/GoogleCloudPlatform/vertex-ai-samples☆1,539Dec 17, 2021Updated 4 years ago
- Examples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.☆234Feb 20, 2026Updated last month
- Interactive tools and developer experiences for Big Data on Google Cloud Platform.☆969Sep 2, 2022Updated 3 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆167Jul 25, 2018Updated 7 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017☆1,410Feb 20, 2026Updated last month
- ☆54Aug 3, 2017Updated 8 years ago
- ☆31Oct 17, 2018Updated 7 years ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 7 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆164May 31, 2017Updated 8 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,620Feb 27, 2026Updated 3 weeks ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Jan 15, 2017Updated 9 years ago
- Database plugins☆13Mar 6, 2026Updated 2 weeks ago
- ☆277Jun 1, 2016Updated 9 years ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,515Updated this week
- This repository includes end-to-end labs on how to use GCP for applied data science☆14Aug 28, 2018Updated 7 years ago
- Google Cloud Client Libraries for Python☆5,236Updated this week
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆108Sep 19, 2024Updated last year
- Code samples used on cloud.google.com☆8,015Updated this week
- Google Cloud Pubsub connector for Spark Streaming☆17Oct 21, 2021Updated 4 years ago
- Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.☆1,288Feb 17, 2026Updated last month
- Uses Cloud Build to deploy a scalable batch ingestion pipeline consisting of GCS, Cloud Functions, Dataflow and BigQuery☆22Dec 7, 2022Updated 3 years ago
- Example Kubernetes app that shows how to build a 'pipeline' to stream data into BigQuery. Uses Redis or Google Cloud PubSub☆131Oct 20, 2020Updated 5 years ago
- Machine Learning on Google Cloud Platform☆514Updated this week
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Feb 13, 2018Updated 8 years ago
- A collection of Google Cloud Platform (GCP) plugins☆49Mar 3, 2026Updated 2 weeks ago
- Terraform Examples☆16Aug 13, 2015Updated 10 years ago
- Uses Google Prediction API to label GitHub Issues as they are created.☆27Dec 5, 2018Updated 7 years ago
- Tools for creating Dataproc custom images☆35Feb 23, 2026Updated 3 weeks ago