stijndehaes / pyspark-k8s-example
☆25Updated 6 years ago
Alternatives and similar repositories for pyspark-k8s-example
Users that are interested in pyspark-k8s-example are comparing it to the libraries listed below
Sorting:
- Spark on Kubernetes infrastructure Helm charts repo☆201Updated 2 years ago
- Performance optimization for Spark running on Kubernetes☆88Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- The Internals of Delta Lake☆184Updated 4 months ago
- spark on kubernetes☆105Updated 2 years ago
- ☆127Updated 4 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Kinesis Connector for Structured Streaming☆136Updated 10 months ago
- Helm charts for Trino and Trino Gateway☆165Updated 2 weeks ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- A guide to running Airflow on Kubernetes☆173Updated 5 years ago
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆26Updated 4 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆750Updated last week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆100Updated 2 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆120Updated 3 years ago
- ☆80Updated 3 weeks ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Updated 5 years ago
- ☆199Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.☆223Updated last month
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆157Updated 3 years ago
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- REST API for Apache Spark on K8S or YARN☆98Updated this week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆335Updated 3 weeks ago
- ☆39Updated 4 years ago
- Pylint plugin for static code analysis on Airflow code☆94Updated 4 years ago
- A repository containing materials for Stateful Functions workshop☆44Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 11 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated last week