gtoonstra / sqlineage
A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking
☆20Updated 7 years ago
Alternatives and similar repositories for sqlineage:
Users that are interested in sqlineage are comparing it to the libraries listed below
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Alluxio Python client - Access Any Data Source with Python☆26Updated last week
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last year
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated last year
- Data Sketches for Apache Spark☆22Updated 2 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- Java library for authoring PMML☆16Updated last month
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆34Updated 4 months ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated 5 months ago
- This repository is no longer maintained.☆15Updated 3 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆47Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆39Updated last month
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Spark Application UI extension for JupyterLab☆10Updated 3 years ago
- ☆10Updated 2 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- ☆39Updated 6 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- IPython magics to work with DBT☆15Updated 2 years ago
- Hadoop Yarn aggregated log parser utility☆23Updated 5 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago