rootsongjc / spark-on-kubernetesLinks
Image building contents for running Spark standalone on Kubernetes
☆16Updated 5 years ago
Alternatives and similar repositories for spark-on-kubernetes
Users that are interested in spark-on-kubernetes are comparing it to the libraries listed below
Sorting:
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 6 years ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 6 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- ☆79Updated last year
- SQL CLI for Apache Flink® via docker-compose☆50Updated last year
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- ☆40Updated 2 years ago
- spark on kubernetes☆104Updated 2 years ago
- pulsar lakehouse connector☆33Updated 3 months ago
- ☆25Updated this week
- Presto Trino with Apache Hive Postgres metastore☆42Updated 10 months ago
- ☆48Updated last year
- Instructions for getting started with Ververica Platform on minikube.☆92Updated last week
- ☆39Updated 6 years ago
- This repository contains recipes for Apache Pinot.☆30Updated 4 months ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Postgresql configured to work as metastore for Hive.☆32Updated 2 years ago
- spark-drools tutorials☆16Updated last year
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated last year
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Updated last year
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆70Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Updated 7 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆21Updated 3 years ago
- Apache Flink docker image☆195Updated 3 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆115Updated last year
- CDAP UI☆20Updated last week