Wh1isper / sparglim
Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!
☆35Updated last month
Related projects: ⓘ
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated last month
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Apache Spark Kubernetes Operator☆48Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- ☆40Updated last year
- REST API for Apache Spark on K8S or YARN☆89Updated last week
- ☆143Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆51Updated this week
- ☆77Updated last year
- ☆232Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆80Updated 6 months ago
- ☆197Updated last month
- ☆49Updated this week
- Apache Hive Metastore as a Standalone server in Docker☆64Updated 3 weeks ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆40Updated last month
- This project provides a reverse proxy for Spark UI on Kubernetes☆14Updated 11 months ago
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- ☆23Updated last week
- A Table format agnostic data sharing framework☆36Updated 7 months ago
- Spline agent for Apache Spark☆183Updated last week
- Storage connector for Trino☆90Updated 2 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆82Updated 5 months ago
- Performance optimization for Spark running on Kubernetes☆84Updated 4 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated 10 months ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆160Updated this week
- ☆129Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆91Updated this week
- Presto Trino with Apache Hive Postgres metastore☆36Updated last week