empathyco / platform-spark-kubernetes-samplesLinks
Spark on Kubernetes samples
☆20Updated 3 years ago
Alternatives and similar repositories for platform-spark-kubernetes-samples
Users that are interested in platform-spark-kubernetes-samples are comparing it to the libraries listed below
Sorting:
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆45Updated 2 years ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆63Updated this week
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- ☆25Updated last year
- ☆57Updated 10 months ago
- Terraform Provider for Airbyte API☆55Updated 3 weeks ago
- Examples and custom spark images for working with the spark-on-k8s operator on AWS☆26Updated 4 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Terraform module to create AWS EMR resources 🇺🇦☆26Updated this week
- Docker image for Spark history server on Kubernetes☆15Updated 5 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆101Updated 2 years ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 3 months ago
- Airflow on Kubernetes Operator☆88Updated 2 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Helm charts for Trino and Trino Gateway☆166Updated last week
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- ☆33Updated this week
- Apache Flink (Pyflink) and Related Projects☆39Updated last month
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Presto Trino with Apache Hive Postgres metastore☆41Updated 8 months ago
- Performance optimization for Spark running on Kubernetes☆89Updated 4 years ago
- ☆56Updated this week
- spark on kubernetes☆104Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- ☆18Updated 11 months ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆21Updated last week