apache-spark-on-k8s / kubernetes-HDFS
Repository holding configuration files for running an HDFS cluster in Kubernetes
☆397Updated 3 months ago
Alternatives and similar repositories for kubernetes-HDFS:
Users that are interested in kubernetes-HDFS are comparing it to the libraries listed below
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Running YARN on Kubernetes with PetSet controller.☆165Updated 6 years ago
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 2 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- Kubernetes operator that provides control plane for managing Apache Flink applications☆569Updated 4 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆200Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆81Updated 4 years ago
- Curated Big Data Applications for Kubernetes☆99Updated last year
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 4 years ago
- Docker packaging for Apache Flink☆140Updated 4 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆176Updated 2 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆157Updated 3 years ago
- Exports Hadoop HDFS content statistics to Prometheus☆152Updated last week
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 9 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- A hadoop exporter for prometheus, scrape hadoop metrics (including HDFS, YARN, MAPREDUCE, HBASE. etc.) from hadoop components jmx url.☆87Updated 4 years ago
- Docker image with Ambari☆291Updated 7 years ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆201Updated this week
- Docker packaging for Apache Flink☆333Updated 2 months ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,083Updated last year
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 4 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆172Updated 2 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆277Updated last year
- Apache YuniKorn Core☆882Updated this week
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆131Updated last year
- Cloudera deployment automation with Ansible☆198Updated 4 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆898Updated 2 months ago