rootsongjc / spark-on-kubernetesLinks
Image building contents for running Spark standalone on Kubernetes
☆16Updated 5 years ago
Alternatives and similar repositories for spark-on-kubernetes
Users that are interested in spark-on-kubernetes are comparing it to the libraries listed below
Sorting:
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 6 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆20Updated 3 years ago
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 6 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- spark-drools tutorials☆16Updated last year
- ☆39Updated 6 years ago
- ☆14Updated 2 years ago
- Helm chart from stable/hadoop, updated to hadoop 3.2.1☆22Updated 5 years ago
- ☆79Updated last year
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- Helm chart: single-node, pseudo-distributed, kerberized, hadoop cluster: K8S☆19Updated 7 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 12 years ago
- ☆11Updated 9 years ago
- This repository contains recipes for Apache Pinot.☆30Updated 3 months ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Updated 5 months ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Kubernetes manifest files for building Hadoop clusters☆9Updated 6 years ago
- Ranger Hive Metastore Plugin☆18Updated last year
- ☆40Updated 2 years ago
- Docker Image for Kudu☆38Updated 6 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated 2 years ago
- The Internals of PySpark☆26Updated 5 months ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB☆18Updated 2 years ago
- Documentation of Hologres☆13Updated 4 years ago
- Get started with Apache Beam and Flink☆43Updated 8 years ago
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Updated last year
- SQL CLI for Apache Flink® via docker-compose☆48Updated last year