testdrivenio / spark-kubernetes
spark on kubernetes
☆105Updated 2 years ago
Alternatives and similar repositories for spark-kubernetes:
Users that are interested in spark-kubernetes are comparing it to the libraries listed below
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆198Updated 2 years ago
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated last year
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆128Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆82Updated 4 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆173Updated last week
- Spark on Kubernetes infrastructure Docker images repo☆37Updated 2 years ago
- Examples of Spark 3.0☆46Updated 4 years ago
- ☆40Updated 4 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆68Updated 4 years ago
- Performance optimization for Spark running on Kubernetes☆86Updated 4 years ago
- ☆79Updated last year
- ☆25Updated 5 months ago
- Apache Spark docker container image (Standalone mode)☆35Updated 4 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆39Updated 3 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Updated 4 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆214Updated 2 months ago
- Interactive Notebooks that support the book☆39Updated 4 years ago
- A simple Spark-powered ETL framework that just works 🍺☆179Updated 2 weeks ago
- REST API for Apache Spark on K8S or YARN☆95Updated this week
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 6 months ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago