stijndehaes / pyspark-k8s-exampleLinks
☆25Updated 6 years ago
Alternatives and similar repositories for pyspark-k8s-example
Users that are interested in pyspark-k8s-example are comparing it to the libraries listed below
Sorting:
- Spark on Kubernetes infrastructure Helm charts repo☆202Updated 3 years ago
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆487Updated last week
- A Helm chart to install Apache Airflow on Kubernetes☆290Updated 3 weeks ago
- A guide to running Airflow on Kubernetes☆173Updated 6 years ago
- ☆128Updated 5 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆798Updated 3 weeks ago
- A simple spark standalone cluster for your testing environment purposses☆570Updated last year
- ☆40Updated 4 years ago
- REST API for Apache Spark on K8S or YARN☆108Updated this week
- Airflow Backfill UI based plugin for existing / new Airflow environment☆64Updated 4 years ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆705Updated last year
- A simplified, lightweight ETL Framework based on Apache Spark☆586Updated last year
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆26Updated 4 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆52Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 6 months ago
- A boilerplate for writing PySpark Jobs☆394Updated last year
- spark on kubernetes☆104Updated 2 years ago
- A plugin for Apache Airflow that allows you to edit DAGs in browser☆454Updated 2 weeks ago
- fast and scalable Airflow on Kubernetes Setup.☆28Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆500Updated 3 weeks ago
- PySpark test helper methods with beautiful error messages☆730Updated 2 months ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆134Updated 3 years ago
- Airflow Unit Tests and Integration Tests☆261Updated 3 years ago
- This project contains examples which demonstrate how to deploy analytic models to mission-critical, scalable production environments leve…☆873Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆130Updated 3 weeks ago
- Boilerplate for PySpark on Cloud Kubernetes☆33Updated 4 years ago
- A convenient Python wrapper for Apache NiFi☆270Updated 2 weeks ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆288Updated last year