apache-spark-on-k8s / spark
Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
☆612Updated 5 years ago
Alternatives and similar repositories for spark:
Users that are interested in spark are comparing it to the libraries listed below
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆397Updated 3 months ago
- Running YARN on Kubernetes with PetSet controller.☆165Updated 6 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 2 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 9 years ago
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 4 years ago
- Docker build for Apache Spark☆673Updated 3 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆2,842Updated last week
- A Kafka Operator for Kubernetes☆294Updated 6 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated 2 years ago
- Mirror of Apache Bahir☆337Updated last year
- Apache Kafka on Apache Mesos☆412Updated 6 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆176Updated 2 years ago
- Benchmark Suite for Apache Spark☆238Updated last year
- Docker image with Ambari☆291Updated 7 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆569Updated 4 months ago
- Mirror of Apache Toree (Incubating)☆741Updated 2 months ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,083Updated last year
- DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spa…☆156Updated 3 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆898Updated 2 months ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- DoctorK is a service for Kafka cluster auto healing and workload balancing☆631Updated 3 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Used to build the mesosphere/spark docker image and the DC/OS Spark package☆52Updated 4 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆553Updated 3 years ago