open-metadata / openmetadata-sqllineage
SQL Lineage Analysis Tool powered by Python
☆13Updated last year
Related projects: ⓘ
- Apache DolphinScheduler Python API, aka PyDolphinscheduler.☆50Updated 3 weeks ago
- ☆46Updated last year
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆106Updated 9 months ago
- Visualize column-level data lineage in Spark SQL☆85Updated 2 years ago
- Open-source metadata collector based on ODD Specification☆42Updated 10 months ago
- a proof of concept project to implement sqllineage with antlr4.☆35Updated 11 months ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆16Updated last month
- Java demos for the General SQL Parser library☆122Updated this week
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆18Updated 8 months ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆46Updated last year
- DataQuality for BigData☆139Updated 9 months ago
- ☆82Updated last month
- ☆66Updated last year
- A library based on delta for Spark and MLSQL☆61Updated 3 years ago
- Spline agent for Apache Spark☆183Updated last week
- Instructions for getting started with Ververica Platform on minikube.☆89Updated 4 months ago
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- Stock analysis MLOps system based on DolphinScheduler☆12Updated last year
- Cluster manager for Apache Doris☆168Updated 10 months ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 7 months ago
- datacollector-oss☆89Updated last month
- sql code autocomplete☆39Updated 4 years ago
- Trino Connector for Apache Paimon.☆25Updated 3 weeks ago
- Spark SQL listener to record lineage information☆28Updated 3 years ago
- Spark ClickHouse Connector build on DataSourceV2 API☆181Updated this week
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆139Updated 8 months ago
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆24Updated 3 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆80Updated 6 months ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆42Updated last week