apache / spark-docker
Official Dockerfile for Apache Spark
☆106Updated last week
Related projects ⓘ
Alternatives and complementary repositories for spark-docker
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated 2 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆195Updated this week
- Spline agent for Apache Spark☆186Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆214Updated this week
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆163Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆400Updated this week
- ☆252Updated 2 weeks ago
- REST API for Apache Spark on K8S or YARN☆91Updated this week
- ☆151Updated last week
- Snowflake Data Source for Apache Spark.☆217Updated this week
- Performance Observability for Apache Spark☆194Updated this week
- Spark on Kubernetes infrastructure Helm charts repo☆199Updated 2 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆162Updated this week
- The Internals of Delta Lake☆182Updated last month
- Adapter for dbt that executes dbt pipelines on Apache Flink☆83Updated 7 months ago
- A simple Spark-powered ETL framework that just works 🍺☆178Updated 11 months ago
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆191Updated this week
- Delta Lake examples☆205Updated last month
- ☆78Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆35Updated last month
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆60Updated this week
- Apache Spark Kubernetes Operator☆63Updated last month
- Apache Iceberg Documentation Site☆42Updated 9 months ago
- Avro SerDe for Apache Spark structured APIs.☆231Updated 3 months ago