SETL-Framework / setlLinks
A simple Spark-powered ETL framework that just works πΊ
β182Updated last month
Alternatives and similar repositories for setl
Users that are interested in setl are comparing it to the libraries listed below
Sorting:
- Smart Automation Tool for building modern Data Lakes and Data Pipelinesβ122Updated last week
- The Internals of Delta Lakeβ186Updated 10 months ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productiveβ184Updated 3 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.β230Updated last week
- A simplified, lightweight ETL Framework based on Apache Sparkβ587Updated last year
- Flowchart for debugging Spark applicationsβ107Updated last year
- DataQuality for BigDataβ144Updated last year
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.β76Updated last year
- Snowflake Data Source for Apache Spark.β230Updated 3 weeks ago
- β63Updated 6 years ago
- A tool to validate data, built around Apache Spark.β100Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.β346Updated last year
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data piβ¦β96Updated last month
- Spline agent for Apache Sparkβ199Updated last week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aβ¦β130Updated last week
- Code snippets used in demos recorded for the blog.β37Updated 2 weeks ago
- ACID Data Source for Apache Spark based on Hive ACIDβ97Updated 4 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are inβ¦β94Updated 6 months ago
- Qubole Sparklens tool for performance tuning Apache Sparkβ585Updated last year
- Build configuration-driven ETL pipelines on Apache Sparkβ161Updated 3 years ago
- Data Lineage Tracking And Visualization Solutionβ647Updated this week
- Avro SerDe for Apache Spark structured APIs.β236Updated 5 months ago
- Custom state store providers for Apache Sparkβ92Updated 9 months ago
- Spark style guideβ264Updated last year
- Spark package for checking data qualityβ222Updated 5 years ago
- A library that brings useful functions from various modern database management systems to Apache Sparkβ60Updated 2 years ago
- Examples of Spark 3.0β45Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerβ25Updated 10 months ago
- The Internals of Spark SQLβ477Updated this week
- Magic to help Spark pipelines upgradeβ34Updated last year