gtoonstra / sqlineageLinks
A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking
☆20Updated 7 years ago
Alternatives and similar repositories for sqlineage
Users that are interested in sqlineage are comparing it to the libraries listed below
Sorting:
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- This repository is no longer maintained.☆15Updated 3 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- A cloud native data mesh implementation☆12Updated 4 years ago
- ☆39Updated 6 years ago
- spark-drools tutorials☆16Updated last year
- A collection of python utility functions☆11Updated last year
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last week
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Spark SQL magic command for Jupyter notebooks☆36Updated 4 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Updated 2 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated last week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Transporter for integrating OpenLineage with OpenMetadata☆14Updated last week
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Updated 2 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago