GoogleCloudDataproc / custom-imagesLinks
Tools for creating Dataproc custom images
☆34Updated last month
Alternatives and similar repositories for custom-images
Users that are interested in custom-images are comparing it to the libraries listed below
Sorting:
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 3 months ago
- ☆54Updated 7 years ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.☆13Updated last year
- ☆47Updated last year
- Cloud Spanner Connector for Apache Spark☆17Updated 5 months ago
- Sample code with integration between Data Catalog and Hive data source.☆24Updated 4 months ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆69Updated last year
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery☆64Updated 4 years ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆61Updated 5 years ago
- ☆21Updated 2 weeks ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆91Updated 3 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- Airflow on Kubernetes Operator☆88Updated 2 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆148Updated 8 years ago
- Apache Beam examples for running on Google Cloud Dataflow.☆30Updated 6 years ago
- Sample code with integration between Data Catalog and BI data sources.☆32Updated 3 years ago
- Apache Beam example☆26Updated 4 years ago
- Cloud Dataproc: Samples and Utils☆203Updated last week
- Astronomer Core Docker Images☆107Updated last year
- Dataproc templates and pipelines for solving in-cloud data tasks☆129Updated this week
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- ☆66Updated 10 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago