acryldata / datahub
A Generalized Metadata Search & Discovery Tool
☆23Updated this week
Alternatives and similar repositories for datahub:
Users that are interested in datahub are comparing it to the libraries listed below
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆178Updated last week
- ☆79Updated last year
- Helm charts for Trino and Trino Gateway☆161Updated last week
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆44Updated 2 weeks ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆92Updated last year
- ☆40Updated 4 years ago
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated 8 months ago
- ☆40Updated last year
- Setup for running Trino with Hive Metastore on Kubernetes☆100Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated last month
- Spline agent for Apache Spark☆191Updated last week
- Storage connector for Trino☆106Updated last week
- Trino plugin for logging query events into a separate log file.☆39Updated 2 years ago
- ☆189Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆238Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated 3 weeks ago
- Apache Spark Kubernetes Operator☆106Updated this week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- ☆49Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆231Updated this week
- Mirror of Apache Ranger☆15Updated 11 months ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 3 weeks ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 10 months ago
- Monitoring and insights on your data lakehouse tables☆28Updated this week
- Performance optimization for Spark running on Kubernetes☆87Updated 4 years ago
- Multi-hop declarative data pipelines☆112Updated last week
- ☆56Updated this week
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆209Updated this week
- A load balancer / proxy / gateway for prestodb☆357Updated 8 months ago