linkedin / datahub-gmaLinks
General Metadata Architecture
☆125Updated this week
Alternatives and similar repositories for datahub-gma
Users that are interested in datahub-gma are comparing it to the libraries listed below
Sorting:
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 10 months ago
- Spline agent for Apache Spark☆193Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆840Updated 2 weeks ago
- A data generator source connector for Flink SQL based on data-faker.☆221Updated last year
- Apache DataLab (incubating)☆153Updated last year
- A load balancer / proxy / gateway for prestodb☆358Updated 10 months ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆46Updated last month
- A tool to install, configure and manage Trino installations☆27Updated 3 years ago
- Visualize column-level data lineage in Spark SQL☆91Updated 3 years ago
- Instructions for getting started with Ververica Platform on minikube.☆91Updated 4 months ago
- Apache Flink Website☆152Updated this week
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆222Updated 2 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Apache Calcite Avatica☆260Updated 2 months ago
- Trino Connector for Apache Paimon.☆34Updated last month
- Remote Shuffle Service for Flink☆190Updated 2 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆189Updated this week
- Apache Iceberg Documentation Site☆42Updated last year
- Docker image for Apache Hive Metastore☆71Updated 2 years ago
- Data Lineage Tracking And Visualization Solution☆627Updated this week
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆283Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆224Updated 2 months ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- A playground to experience Gravitino☆48Updated 3 weeks ago
- The gateway component to make Spark on K8s much easier for Spark users.☆192Updated last month
- Apache Flink connector for ElasticSearch☆82Updated this week
- A simple Spark-powered ETL framework that just works 🍺☆181Updated last month
- Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.☆182Updated 2 weeks ago