japila-books / delta-lake-internals
The Internals of Delta Lake
โ183Updated 2 weeks ago
Alternatives and similar repositories for delta-lake-internals:
Users that are interested in delta-lake-internals are comparing it to the libraries listed below
- A library that provides useful extensions to Apache Spark and PySpark.โ207Updated last month
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productiveโ186Updated last year
- A simple Spark-powered ETL framework that just works ๐บโ178Updated last year
- Flowchart for debugging Spark applicationsโ104Updated 4 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ118Updated last week
- The Internals of Spark SQLโ459Updated 2 weeks ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are inโฆโ86Updated 9 months ago
- Spline agent for Apache Sparkโ190Updated 3 weeks ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaโฆโ722Updated last week
- Avro SerDe for Apache Spark structured APIs.โ231Updated 6 months ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.โ75Updated 9 months ago
- Snowflake Data Source for Apache Spark.โ222Updated last month
- Examples of Spark 3.0โ47Updated 4 years ago
- Qubole Sparklens tool for performance tuning Apache Sparkโ569Updated 7 months ago
- Custom state store providers for Apache Sparkโ92Updated 2 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerโ25Updated 3 weeks ago
- The Internals of Spark on Kubernetesโ70Updated 2 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)โ438Updated last month
- A simplified, lightweight ETL Framework based on Apache Sparkโ585Updated last year
- ACID Data Source for Apache Spark based on Hive ACIDโ97Updated 3 years ago
- Spark style guideโ257Updated 3 months ago
- The Internals of Spark Structured Streamingโ416Updated 2 years ago
- โ306Updated 6 years ago
- โ63Updated 5 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelinesโ114Updated this week
- Build configuration-driven ETL pipelines on Apache Sparkโ159Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | ้กน็ฎๅทฒ่ฟ็งป่ณ Apaโฆโ172Updated 2 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.โ345Updated 7 months ago
- Sample processing code using Spark 2.1+ and Scalaโ51Updated 4 years ago
- Spark Structured Streaming State Toolsโ34Updated 4 years ago