MarquezProject / marquez
Collect, aggregate, and visualize a data ecosystem's metadata
☆1,732Updated last week
Related projects: ⓘ
- An Open Standard for lineage metadata collection☆1,708Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,391Updated last week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,866Updated last week
- ☆1,606Updated this week
- Dremio - the missing link in modern data☆1,356Updated 2 weeks ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆984Updated this week
- SQL Lineage Analysis Tool powered by Python☆1,282Updated last week
- Egeria core☆796Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,540Updated 4 months ago
- Apache Atlas☆1,813Updated 2 weeks ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,252Updated last week
- Dynamically generate Apache Airflow DAGs from YAML configuration files☆1,158Updated last week
- An open protocol for secure data sharing☆747Updated 3 weeks ago
- Efficient data transformation and modeling framework that is backwards compatible with dbt.☆1,612Updated this week
- Hop Orchestration Platform☆937Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆677Updated last month
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,126Updated this week
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,193Updated last week
- The interoperable, open source catalog for Apache Iceberg☆1,012Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆780Updated 2 weeks ago
- Generate and Visualize Data Lineage from query history☆308Updated last year
- Utility functions for dbt projects.☆1,334Updated last week
- Data Lineage Tracking And Visualization Solution☆596Updated last week
- Port(ish) of Great Expectations to dbt test macros☆1,044Updated last week
- Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code☆592Updated this week
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,104Updated last year
- Guides and docs to help you get up and running with Apache Airflow.☆798Updated last year
- Apache Iceberg☆6,161Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆1,887Updated this week
- Mirror of Apache griffin☆1,123Updated last week