HPEEzmeral / spark-on-k8s
☆14Updated this week
Related projects: ⓘ
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆81Updated 4 years ago
- Dione - a Spark and HDFS indexing library☆49Updated 5 months ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Performance optimization for Spark running on Kubernetes☆84Updated 4 years ago
- Schema Registry integration for Apache Spark☆39Updated last year
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆82Updated 5 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆98Updated 2 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆199Updated last year
- Extensible streaming ingestion pipeline on top of Apache Spark☆43Updated 5 months ago
- Instant access to the Spark cluster from anywhere☆16Updated 3 years ago
- Presto & Alluxio Dockers for blazing fast analytics☆13Updated 4 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆14Updated 6 months ago
- Spark Structured Streaming State Tools☆34Updated 4 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆156Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆86Updated 6 months ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated last year
- ☆40Updated last year
- Rocksdb state storage implementation for Structured Streaming.☆16Updated 3 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆101Updated 4 years ago
- A Spark datasource for the HadoopOffice library☆39Updated last year
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆36Updated 6 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Spark on Kubernetes infrastructure Docker images repo☆37Updated last year
- ☆17Updated 5 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated last month
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 2 years ago
- ☆104Updated last year