empathyco / platform-spark-kubernetes-samples
Spark on Kubernetes samples
☆20Updated 3 years ago
Alternatives and similar repositories for platform-spark-kubernetes-samples:
Users that are interested in platform-spark-kubernetes-samples are comparing it to the libraries listed below
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated last month
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆27Updated 4 years ago
- Performance optimization for Spark running on Kubernetes☆87Updated 4 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆19Updated 7 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- Helm charts for Trino and Trino Gateway☆162Updated last week
- Apache Flink (Pyflink) and Related Projects☆37Updated last week
- Presto Trino with Apache Hive Postgres metastore☆41Updated 7 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆101Updated 2 years ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆61Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- Sample Airflow DAGs☆62Updated 2 years ago
- ☆12Updated 2 months ago
- ☆27Updated last month
- Terraform Provider for Airbyte API☆54Updated last week
- Docker image for Spark history server on Kubernetes☆15Updated 5 years ago
- Library which aim to generate kubernetes yamls templates from an Airflow dag using the Airflow Kuberntes Pod Operator☆10Updated 3 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25Updated 2 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- ☆51Updated this week
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated 8 months ago
- Aiven's S3 Sink Connector for Apache Kafka®☆69Updated 7 months ago
- Apache Spark Kubernetes Operator☆113Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago