gtoonstra / sqlineage
A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking
☆20Updated 7 years ago
Alternatives and similar repositories for sqlineage:
Users that are interested in sqlineage are comparing it to the libraries listed below
- Asynchronous actions for PySpark☆47Updated 3 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Alluxio Python client - Access Any Data Source with Python☆26Updated 2 weeks ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Data Sketches for Apache Spark☆22Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- ☆39Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆49Updated last year
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Updated last year
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- DDL parase and Convert to BigQuery JSON schema and DDL statements☆87Updated last year
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- A cloud native data mesh implementation☆12Updated 4 years ago
- Functional testing framework for Big Data pipelines.☆57Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated last month
- Apache Airflow CI pipeline☆18Updated 5 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Spark SQL magic command for Jupyter notebooks☆35Updated 3 years ago
- Java event logs collector for hadoop and frameworks☆39Updated 5 months ago
- Java library for authoring PMML☆15Updated last week
- A tool and library for easily deploying applications on Apache YARN☆142Updated 10 months ago