gtoonstra / sqlineage
A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking
☆20Updated 7 years ago
Alternatives and similar repositories for sqlineage:
Users that are interested in sqlineage are comparing it to the libraries listed below
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- A cloud native data mesh implementation☆12Updated 4 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- ☆39Updated 6 years ago
- A SQL parser☆56Updated 2 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- A Spark datasource for the HadoopOffice library☆38Updated 2 years ago
- A plugin for Airflow that create and manage your DAG with web UI.☆20Updated 7 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Apache Drill Dialect for SQL Alchemy☆54Updated 2 weeks ago
- Java library for authoring PMML☆15Updated last week
- Open-source metadata collector based on ODD Specification☆43Updated last year
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- PMML scoring library for Scala☆63Updated 3 weeks ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 2 months ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- DDL parase and Convert to BigQuery JSON schema and DDL statements☆88Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated 3 weeks ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 3 months ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago