TomLous / medium-spark-k8s
Spark on Kubernetes using Helm
☆34Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for medium-spark-k8s
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Flowchart for debugging Spark applications☆101Updated last month
- The official repository for the Rock the JVM Spark Optimization 2 course☆37Updated 11 months ago
- Magic to help Spark pipelines upgrade☆33Updated last month
- The official repository for the Rock the JVM Spark Optimization with Scala course☆55Updated 11 months ago
- This project provides a reverse proxy for Spark UI on Kubernetes☆14Updated last year
- spark on kubernetes☆105Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated 2 months ago
- Examples of Spark 3.0☆47Updated 3 years ago
- The Internals of Delta Lake☆182Updated last month
- ☆43Updated 3 months ago
- Code snippets used in demos recorded for the blog.☆29Updated 3 weeks ago
- ☆63Updated 5 years ago
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆61Updated 4 months ago
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆199Updated 2 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆98Updated 2 years ago
- Presto Trino with Apache Hive Postgres metastore☆37Updated 2 months ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- ☆78Updated last year
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆25Updated last year
- CSD for Apache Airflow☆20Updated 5 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆110Updated this week
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 2 years ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆156Updated last month
- Spark on Kubernetes infrastructure Docker images repo☆37Updated 2 years ago