TIBCOSoftware / snappy-on-k8sLinks
An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes
☆84Updated 5 years ago
Alternatives and similar repositories for snappy-on-k8s
Users that are interested in snappy-on-k8s are comparing it to the libraries listed below
Sorting:
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆157Updated 3 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Running YARN on Kubernetes with PetSet controller.☆166Updated 7 years ago
- Performance optimization for Spark running on Kubernetes☆89Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 3 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- Setup for running Trino with Hive Metastore on Kubernetes☆102Updated 3 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- ☆34Updated 4 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Used to build the mesosphere/spark docker image and the DC/OS Spark package☆52Updated 4 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 6 months ago
- StreamLine - Streaming Analytics☆165Updated last year
- Schema Registry☆17Updated last year
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆397Updated 11 months ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- AMQP data source for dstream (Spark Streaming)☆26Updated 3 years ago
- Multiple node presto cluster on docker container☆124Updated 3 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated last year
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- spark on kubernetes☆104Updated 2 years ago
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated 2 years ago