YotpoLtd / metorikkuLinks
A simplified, lightweight ETL Framework based on Apache Spark
โ589Updated last year
Alternatives and similar repositories for metorikku
Users that are interested in metorikku are comparing it to the libraries listed below
Sorting:
- Data Lineage Tracking And Visualization Solutionโ638Updated last week
- A simple Spark-powered ETL framework that just works ๐บโ182Updated this week
- Qubole Sparklens tool for performance tuning Apache Sparkโ580Updated last year
- Spline agent for Apache Sparkโ196Updated last week
- The Internals of Spark SQLโ471Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaโฆโ775Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.โ344Updated last year
- The Internals of Delta Lakeโ184Updated 6 months ago
- Avro SerDe for Apache Spark structured APIs.โ235Updated last month
- Essential Spark extensions and helper methods โจ๐ฒโ759Updated 3 weeks ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ126Updated last week
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productiveโ186Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.โ228Updated last week
- Build configuration-driven ETL pipelines on Apache Sparkโ160Updated 2 years ago
- DataQuality for BigDataโ144Updated last year
- The Internals of Spark Structured Streamingโ418Updated 2 years ago
- A Spark plugin for reading and writing Excel filesโ509Updated last week
- Snowflake Data Source for Apache Spark.โ226Updated last month
- Smart Automation Tool for building modern Data Lakes and Data Pipelinesโ124Updated this week
- A Spark Atlas connector to track data lineage in Apache Atlasโ267Updated 2 years ago
- Spark style guideโ260Updated 10 months ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)โ447Updated last month
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.โ916Updated last month
- A load balancer / proxy / gateway for prestodbโ358Updated last year
- โ311Updated 6 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.โ283Updated last week
- The Workload Analyzer collects Prestoยฎ and Trino workload statistics, and analyzes themโ135Updated last year
- Spark on Kubernetes infrastructure Helm charts repoโ204Updated 2 years ago
- Spark package for checking data qualityโ221Updated 5 years ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.โ142Updated last year