GoogleCloudDataproc / cloud-dataproc
Cloud Dataproc: Samples and Utils
☆198Updated last month
Related projects: ⓘ
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆369Updated last week
- ☆126Updated 4 months ago
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆279Updated last week
- Run in all nodes of your cluster before the cluster starts - lets you customize your cluster☆587Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated 11 months ago
- ☆24Updated this week
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆63Updated 4 months ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 7 years ago
- Data Quality Engine for BigQuery☆255Updated 2 months ago
- Opinion Analysis of News, Threaded Conversations, and User Generated Content☆100Updated 4 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆116Updated this week
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆142Updated 3 months ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 4 years ago
- Repository of sample Databricks notebooks☆242Updated 5 months ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆167Updated 6 years ago
- ☆195Updated 11 months ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆60Updated 4 years ago
- ☆84Updated 6 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- ☆64Updated last month
- ☆11Updated 9 months ago
- ☆46Updated 4 months ago
- ☆55Updated last week
- markup to create labs for courses from the Google Cloud training catalog.☆49Updated 2 years ago
- BigQuery ML SQL templates for common marketing use cases☆169Updated 5 years ago
- Data Warehousing Made Easy with Google BigQuery and Apache Airflow☆19Updated 5 years ago
- A curated list of awesome resources for Apache Beam☆146Updated last year
- ☆81Updated 10 months ago
- An example Apache Beam project.☆111Updated 7 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆36Updated 6 years ago