Operator for Apache Spark-on-Kubernetes for Stackable Data Platform
☆69Updated this week
Alternatives and similar repositories for spark-k8s-operator
Users that are interested in spark-k8s-operator are comparing it to the libraries listed below
Sorting:
- Stackable Operator for Apache Airflow☆32Updated this week
- ☆27Feb 19, 2026Updated last week
- An Operator for Apache Druid for Stackable Data Platform☆12Feb 19, 2026Updated last week
- Kubernetes Operator for Apache HBase built by Stackable for the Stackable Data Platform☆19Feb 20, 2026Updated last week
- A kubernetes operator for Apache NiFi☆46Updated this week
- Kubernetes operator for Apache Hadoop HDFS used by the Stackable Data Platform☆52Feb 20, 2026Updated last week
- A tool that can be used to deploy and manager Apache ZooKeeper clusters/ensembles☆35Updated this week
- ☆61Feb 20, 2026Updated last week
- Stackable Operator for Apache Kafka☆27Feb 20, 2026Updated last week
- A collection of crates to make implementing Kubernetes operators easier☆153Feb 20, 2026Updated last week
- Stackable's central documentation repository built on Antora☆13Feb 5, 2026Updated 3 weeks ago
- Trino load balancer with support for routing, queueing and auto-scaling☆37Feb 17, 2026Updated last week
- README for Rekcurd projects☆16Jul 31, 2019Updated 6 years ago
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆53May 26, 2022Updated 3 years ago
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 5 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆28Feb 9, 2026Updated 2 weeks ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆18Jul 31, 2023Updated 2 years ago
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- An Ansible collection for Cloudera Platform for cloud and Data Services☆21Jan 30, 2026Updated last month
- This charmed operator automates the operational procedures of running Prometheus, an open-source metrics backend.☆20Updated this week
- Tools to help search relevance engineers and business users tune search results for their OpenSearch applications.☆29Updated this week
- ☆12Nov 20, 2018Updated 7 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28May 19, 2025Updated 9 months ago
- Spark operator deployment and usage on OpenShift☆29Nov 25, 2024Updated last year
- Kubernetes controller for automated Node operations☆31Apr 21, 2025Updated 10 months ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆58Oct 30, 2018Updated 7 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347May 31, 2024Updated last year
- ☆31Jan 13, 2026Updated last month
- Apache Spark Kubernetes Operator☆263Updated this week
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- Java implementation of the EbMS 2.0 specification.☆10Feb 20, 2026Updated last week
- Denoising GANs -- TensorFlow2 training code for Gaussian denoiser using the GAN framework.☆10Jan 6, 2022Updated 4 years ago
- Repository for Plexe Sumo☆14Aug 4, 2018Updated 7 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 18, 2026Updated last week
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆41Mar 12, 2023Updated 2 years ago
- Multitouch gestures on X11, Linux☆10Nov 22, 2015Updated 10 years ago
- Library for Excel-like calculations with some additional features like Calculation Graph and Custom Functions.☆10Jan 21, 2016Updated 10 years ago
- ☆10Dec 15, 2021Updated 4 years ago
- Charmed Operator for Zinc: a search engine that does full-text indexing. Zinc is a lightweight alternative to elasticsearch.☆10Updated this week