Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
☆35Feb 13, 2018Updated 8 years ago
Alternatives and similar repositories for spark-on-k8s-gcp-examples
Users that are interested in spark-on-k8s-gcp-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54Aug 3, 2017Updated 8 years ago
- This sample app will get up and running quickly with Hive and/or Pig on a Hadoop cluster on Google Compute Engine. For more information …☆19Jan 9, 2018Updated 8 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- Hive Storage Handler for interoperability between BigQuery and Apache Hive☆19Jan 29, 2025Updated last year
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Generate BigQuery tables, load and extract data, based on JSON Table Schema descriptors.☆18Jun 1, 2021Updated 5 years ago
- Question and Answer application using AWS Bedrock, AWS ECS, Langchain, Qdrant, and FastAPI☆15Feb 27, 2024Updated 2 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Highly configurable Helm Presto Chart☆24Nov 13, 2019Updated 6 years ago
- Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.☆292Updated this week
- Compose minio + kafka for bucket notifications☆14Jun 16, 2021Updated 5 years ago
- ☆15Mar 3, 2023Updated 3 years ago
- Campaign Manager 360 and Display & Video 360 Reports to BigQuery connector☆37Apr 18, 2023Updated 3 years ago
- Use Kubernetes to autoscale your spark clusters.☆10May 2, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- ☆31Mar 7, 2025Updated last year
- Docker packaging for Apache Flink☆139Feb 4, 2020Updated 6 years ago
- Load data in BigQuery using Cloud Workflows, Firestore and Cloud Functions.☆11May 12, 2021Updated 5 years ago
- A book about Maven in the style of the Pragmatic Guides published by The Pragmatic Bookshelf☆11Dec 12, 2015Updated 10 years ago
- Content Data Store (HDFS/HBase)☆13Dec 1, 2016Updated 9 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- Test for SparkSQL ScalaPB☆14Jun 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Semana Data Vault☆21May 30, 2025Updated last year
- a library which can be used to create story driven clustered load-testing packages through a very readable and understandable api.☆30May 20, 2010Updated 16 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- GlusterFS plugin for Hadoop HCFS☆69Apr 12, 2022Updated 4 years ago
- Example Apache Flink cluster on Kubernetes☆21Aug 22, 2018Updated 7 years ago
- Examples for how to use the Flink Docker images in a variety of ways☆91Oct 12, 2021Updated 4 years ago
- Global package for Cloud Management in Python☆15Apr 13, 2026Updated 2 months ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Aug 31, 2023Updated 2 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Jul 31, 2018Updated 7 years ago
- ☆14May 27, 2022Updated 4 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Enable Falco to read audit logs from EKS☆11Dec 13, 2020Updated 5 years ago
- Docker Image for Kudu☆38Feb 20, 2019Updated 7 years ago
- An open source library for BigQuery testing.☆14Jun 22, 2022Updated 3 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago