Acxiom / metalus
This project aims to make writing Spark applications easier by abstracting the effort to assemble the driver into reusable steps and pipelines.
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for metalus
- Project to create configurable ETL via lightbend configuration using Spark Structured Streaming☆8Updated 6 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 4 years ago
- Running Presto on k8s☆38Updated 5 years ago
- Kafka Connect FileSystem Connector☆111Updated 2 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 3 months ago
- Apache Ranger Plugin for S3☆19Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated 3 months ago
- ☆26Updated 9 months ago
- Trino plugin for logging query events into a separate log file.☆39Updated last year
- ☆18Updated 5 months ago
- ☆23Updated 5 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆98Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Developing Spark External Data Sources using the V2 API☆46Updated 6 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆158Updated 2 years ago
- Apache Hive Metastore as a Standalone server in Docker☆65Updated 2 months ago
- Apache Spark ETL Utilities☆40Updated 3 weeks ago
- Spline agent for Apache Spark☆186Updated last week
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated last year
- Mirror of Apache Ranger☆15Updated 7 months ago
- ☆21Updated last week
- ☆252Updated 3 weeks ago
- Building custom data sources for Apache Spark, in Java.☆12Updated 4 years ago
- ☆24Updated 3 years ago
- Kafka Sink Connect OrientDB https://www.confluent.io/hub/sanjuthomas/kafka-connect-orientdb☆10Updated 4 months ago
- Capture the logical plan from Spark (SQL)☆21Updated 3 years ago
- Set of ETL utils for Spark☆15Updated 4 years ago
- Sample project for Apache Flink with Streaming Engine and JDBC Sink☆21Updated 7 years ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆139Updated 10 months ago