TIBCOSoftware / snappy-on-k8sLinks
An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes
☆84Updated 5 years ago
Alternatives and similar repositories for snappy-on-k8s
Users that are interested in snappy-on-k8s are comparing it to the libraries listed below
Sorting:
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆157Updated 3 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Performance optimization for Spark running on Kubernetes☆89Updated 4 years ago
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆396Updated 10 months ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Running YARN on Kubernetes with PetSet controller.☆166Updated 7 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 3 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated last year
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 5 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 5 months ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆102Updated 2 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Schema Registry☆16Updated last year
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- ☆34Updated 4 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated 2 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- ☆37Updated 6 years ago
- Airflow on Kubernetes Operator☆87Updated 2 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Used to build the mesosphere/spark docker image and the DC/OS Spark package☆52Updated 4 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago