apache-spark-on-k8s / sparkLinks
Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
☆613Updated 5 years ago
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below
Sorting:
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆398Updated last year
- Running YARN on Kubernetes with PetSet controller.☆166Updated 7 years ago
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆658Updated 3 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated 2 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Updated 10 years ago
- Docker image with Ambari☆291Updated 7 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 3 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆579Updated 2 months ago
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 3 years ago
- Mirror of Apache Bahir☆335Updated 2 years ago
- Used to build the mesosphere/spark docker image and the DC/OS Spark package☆52Updated 4 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Updated 5 years ago
- CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and a…☆360Updated this week
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆158Updated 3 years ago
- ☆763Updated 4 years ago
- Docker packaging for Apache Flink☆139Updated 5 years ago
- Performance optimization for Spark running on Kubernetes☆89Updated 5 years ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,094Updated 2 years ago
- Mirror of Apache Myriad (Incubating)☆153Updated 2 years ago
- Docker build for Apache Spark☆672Updated 3 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Ansible playbooks for deploying Hortonworks Data Platform and DataFlow using Ambari Blueprints☆248Updated 4 years ago
- Benchmark Suite for Apache Spark☆241Updated 2 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,049Updated last week
- A Kafka Operator for Kubernetes☆294Updated 6 years ago
- [EOL] Image build contents for Kubernetes applications.☆47Updated 7 years ago
- Apache YuniKorn Core☆979Updated this week
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,366Updated 2 years ago