SETL-Framework / setlLinks
A simple Spark-powered ETL framework that just works πΊ
β181Updated last month
Alternatives and similar repositories for setl
Users that are interested in setl are comparing it to the libraries listed below
Sorting:
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productiveβ185Updated 2 years ago
- The Internals of Delta Lakeβ184Updated 5 months ago
- A library that provides useful extensions to Apache Spark and PySpark.β226Updated 3 months ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.β75Updated last year
- Spline agent for Apache Sparkβ194Updated this week
- Smart Automation Tool for building modern Data Lakes and Data Pipelinesβ124Updated this week
- A simplified, lightweight ETL Framework based on Apache Sparkβ586Updated last year
- Avro SerDe for Apache Spark structured APIs.β236Updated 2 weeks ago
- DataQuality for BigDataβ144Updated last year
- Snowflake Data Source for Apache Spark.β226Updated last week
- Flowchart for debugging Spark applicationsβ105Updated 8 months ago
- Build configuration-driven ETL pipelines on Apache Sparkβ159Updated 2 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are inβ¦β89Updated last month
- Visualize column-level data lineage in Spark SQLβ92Updated 3 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerβ25Updated 5 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data piβ¦β95Updated this week
- ACID Data Source for Apache Spark based on Hive ACIDβ97Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Sparkβ59Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aβ¦β125Updated last month
- Spark style guideβ259Updated 8 months ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ579Updated 11 months ago
- β63Updated 5 years ago
- Spark package for checking data qualityβ221Updated 5 years ago
- The Internals of Spark SQLβ468Updated this week
- Custom state store providers for Apache Sparkβ92Updated 4 months ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.β344Updated last year
- Data Lineage Tracking And Visualization Solutionβ632Updated this week
- The Internals of Spark on Kubernetesβ71Updated 3 years ago
- Code snippets used in demos recorded for the blog.β37Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0β99Updated 2 years ago