thesquelched / spark-lineageView external linksLinks
Spark SQL listener to record lineage information
☆28Jan 24, 2021Updated 5 years ago
Alternatives and similar repositories for spark-lineage
Users that are interested in spark-lineage are comparing it to the libraries listed below
Sorting:
- spark 字段血缘 spark field lineage☆32Jun 7, 2022Updated 3 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- TSG Client is a Python library for interacting with the TNO Security Gateway (TSG) Core Container☆18Mar 28, 2025Updated 10 months ago
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- REST job server for Apache Spark☆44May 23, 2025Updated 8 months ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- Repository for the OAC (ODRL profile for Access Control) documentation: https://w3id.org/oac☆10Oct 20, 2024Updated last year
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated last year
- ☆16Updated this week
- Ray Framework (https://github.com/ray-project/ray) on Kubernetes☆12Oct 12, 2018Updated 7 years ago
- Data Catalog Project☆11Dec 23, 2024Updated last year
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆15Apr 3, 2025Updated 10 months ago
- Repository of the metadata specification mobilityDCAT-AP☆18Jan 22, 2026Updated 3 weeks ago
- Policy Administration point to handle ODRL policies and provide their Rego-equivalent to the Open Policy Agent☆11Feb 5, 2026Updated last week
- ☆10Apr 13, 2020Updated 5 years ago
- A reference implementation for the did:webs DID method specified here https://github.com/trustoverip/tswg-did-method-webs-specification. …☆13Oct 28, 2024Updated last year
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- Cloudera CDP SDK for Java☆15Feb 10, 2026Updated last week
- sql code autocomplete☆44Sep 2, 2020Updated 5 years ago
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 2 years ago
- Atlassian Bamboo and Bitbucket images for GKE clusters☆10Mar 24, 2022Updated 3 years ago
- spark connector for Milvus☆14Jan 19, 2026Updated 3 weeks ago
- giter8 template for Spark Jobserver☆12Jan 19, 2018Updated 8 years ago
- nestjs blog system☆12Apr 10, 2021Updated 4 years ago
- 🌀 Pontus-X Portal Web App☆12Feb 5, 2026Updated last week
- A Swift micro-framework for generating compact identifiers that are time ordered in distributed systems without the need for synchronizat…☆13Mar 10, 2018Updated 7 years ago
- Scala Mison implementation☆15Nov 16, 2018Updated 7 years ago
- run-dsp is an open source Go implementation of the IDSA dataspaces protocol.☆13Updated this week
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Presto Gateway routes query based on policy.☆12Sep 15, 2020Updated 5 years ago
- A collection of data space building blocks based on the Eclipse Dataspace Components☆13Feb 9, 2026Updated last week
- Tail a log file and send log lines automatically to a kafka topic☆19Jun 8, 2015Updated 10 years ago
- Data Lineage Tracking And Visualization Solution☆655Updated this week