Spark SQL listener to record lineage information
☆28Jan 24, 2021Updated 5 years ago
Alternatives and similar repositories for spark-lineage
Users that are interested in spark-lineage are comparing it to the libraries listed below
Sorting:
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Jun 18, 2020Updated 5 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- FederatedCatalog☆12Updated this week
- TSG Client is a Python library for interacting with the TNO Security Gateway (TSG) Core Container☆18Mar 28, 2025Updated 11 months ago
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- My Blog☆76May 3, 2018Updated 7 years ago
- Facilitates collaboration and governance for all participants in a Data Space.☆13Feb 27, 2026Updated last week
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- REST job server for Apache Spark☆44May 23, 2025Updated 9 months ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- sql code autocomplete☆44Sep 2, 2020Updated 5 years ago
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated 2 years ago
- Infra stuff to run Kubernetes on travisci☆10Mar 7, 2023Updated 3 years ago
- Cloudera CDP SDK for Java☆16Feb 27, 2026Updated last week
- ☆16Updated this week
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated 11 months ago
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- Policy Administration point to handle ODRL policies and provide their Rego-equivalent to the Open Policy Agent☆11Feb 23, 2026Updated 2 weeks ago
- Repository of the metadata specification mobilityDCAT-AP☆18Feb 18, 2026Updated 2 weeks ago
- Repository for the OAC (ODRL profile for Access Control) documentation: https://w3id.org/oac☆10Oct 20, 2024Updated last year
- Atlassian Bamboo and Bitbucket images for GKE clusters☆10Mar 24, 2022Updated 3 years ago
- ☆10Apr 13, 2020Updated 5 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- Data Catalog Project☆11Dec 23, 2024Updated last year
- A reference implementation for the did:webs DID method specified here https://github.com/trustoverip/tswg-did-method-webs-specification. …☆13Oct 28, 2024Updated last year
- 🌀 Pontus-X Portal Web App☆12Updated this week
- Presto Gateway routes query based on policy.☆12Sep 15, 2020Updated 5 years ago
- Scala Mison implementation☆15Nov 16, 2018Updated 7 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- A collection of data space building blocks based on the Eclipse Dataspace Components☆13Feb 9, 2026Updated last month
- ☆17Mar 2, 2026Updated last week
- run-dsp is an open source Go implementation of the IDSA dataspaces protocol.☆13Mar 2, 2026Updated last week
- Data Lineage Tracking And Visualization Solution☆656Updated this week
- ☆15Oct 12, 2021Updated 4 years ago
- 录制Spak视频课程讲解涉及编写的源代码 https://edu.hellobi.com/course/107/overview☆13Apr 23, 2019Updated 6 years ago
- ☆11Jul 18, 2021Updated 4 years ago
- 提供了solr到elasticsearch的语法翻译引擎,兼容现有的solr语法,提供了基于注解的ORM实现☆12Oct 8, 2015Updated 10 years ago