Natural-Intelligence / openLineage-openMetadata-transporterLinks
Transporter for integrating OpenLineage with OpenMetadata
☆15Updated 5 months ago
Alternatives and similar repositories for openLineage-openMetadata-transporter
Users that are interested in openLineage-openMetadata-transporter are comparing it to the libraries listed below
Sorting:
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆66Updated 2 weeks ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 7 months ago
- Data Catalog for Databases and Data Warehouses☆36Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆181Updated last month
- Open-source metadata collector based on ODD Specification☆44Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated this week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- Marquez Web UI☆21Updated 5 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆43Updated last month
- ☆70Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆93Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆115Updated 5 years ago
- Automation, Data Mash, Message Learning, AI Ops, Quantum Ops☆13Updated this week
- a dbt adapter for Apache Doris☆27Updated 2 years ago
- A platform to manage the data product life cycle☆21Updated this week
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆94Updated 3 years ago
- A curated list of Pulsar tools, integrations and resources.☆85Updated 5 years ago
- spark-drools tutorials☆16Updated last month
- Python driver for Timeplus Enterprise or Timeplus Proton☆17Updated last year
- Data pipelines from re-usable components☆107Updated 2 months ago
- Data lineage tools in python☆47Updated last year