linkedin / datahub-gmaLinks
General Metadata Architecture
☆126Updated last week
Alternatives and similar repositories for datahub-gma
Users that are interested in datahub-gma are comparing it to the libraries listed below
Sorting:
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 11 months ago
- Spline agent for Apache Spark☆194Updated this week
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆47Updated last month
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Apache DataLab (incubating)☆153Updated last year
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆190Updated 2 weeks ago
- Spark Connector to read and write with Pulsar☆113Updated this week
- A data generator source connector for Flink SQL based on data-faker.☆224Updated last year
- Schema Registry☆16Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆181Updated last month
- Apache Calcite Avatica☆260Updated 3 months ago
- Instructions for getting started with Ververica Platform on minikube.☆92Updated 5 months ago
- A tool to install, configure and manage Trino installations☆27Updated 3 years ago
- A playground to experience Gravitino☆50Updated last month
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆840Updated last month
- Ranger Hive Metastore Plugin☆18Updated last year
- Apache Iceberg Documentation Site☆42Updated last year
- FeatHub - A stream-batch unified feature store for real-time machine learning☆334Updated last year
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆142Updated last year
- The Internals of Spark on Kubernetes☆71Updated 3 years ago
- Egeria core☆852Updated this week
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆216Updated last week
- DataQuality for BigData☆144Updated last year
- Data Lineage Tracking And Visualization Solution☆632Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- A load balancer / proxy / gateway for prestodb☆358Updated 11 months ago
- ☆47Updated last year
- Apache Flink connectors for Pravega.☆94Updated last year