apache-spark-on-k8s / kubernetes-HDFS
Repository holding configuration files for running an HDFS cluster in Kubernetes
☆396Updated 6 months ago
Alternatives and similar repositories for kubernetes-HDFS:
Users that are interested in kubernetes-HDFS are comparing it to the libraries listed below
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Running YARN on Kubernetes with PetSet controller.☆166Updated 7 years ago
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆658Updated 2 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- Kubernetes operator that provides control plane for managing Apache Flink applications☆570Updated 7 months ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆83Updated 5 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆158Updated 3 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆199Updated 2 years ago
- Apache YuniKorn Core☆915Updated this week
- Performance optimization for Spark running on Kubernetes☆87Updated 4 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆180Updated 2 years ago
- Docker packaging for Apache Flink☆139Updated 5 years ago
- Exports Hadoop HDFS content statistics to Prometheus☆153Updated last week
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆281Updated 3 weeks ago
- Curated Big Data Applications for Kubernetes☆101Updated last year
- Docker packaging for Apache Flink☆343Updated last month
- Docker image with Ambari☆290Updated 7 years ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆634Updated last week
- Ansible playbooks for deploying Hortonworks Data Platform and DataFlow using Ambari Blueprints☆250Updated 4 years ago
- Examples for how to use the Flink Docker images in a variety of ways☆91Updated 3 years ago
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆909Updated this week
- A load balancer / proxy / gateway for prestodb☆357Updated 8 months ago
- CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and a…☆357Updated this week
- Mirror of Apache Bahir☆336Updated last year
- Schema Registry☆16Updated 9 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆101Updated 2 years ago
- TPC-DS Kit for Impala☆171Updated 10 months ago
- ☆382Updated last year
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆211Updated last week