magda-io / magda
A federated, open-source data catalog for all your big data and small data
β513Updated this week
Related projects β
Alternatives and complementary repositories for magda
- π Awesome Data Catalogs and Observability Platforms.β720Updated 3 months ago
- Dremio - the missing link in modern dataβ1,378Updated 2 weeks ago
- Tool to automate data quality checks on data pipelinesβ249Updated 2 years ago
- An Open Standard for lineage metadata collectionβ1,763Updated this week
- Egeria coreβ808Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadataβ1,774Updated this week
- The metrics layer for your data. Join us at https://metriql.com/slackβ298Updated last year
- Generate and Visualize Data Lineage from query historyβ311Updated last year
- ODD Specification is a universal open standard for collecting metadata.β129Updated 2 weeks ago
- Data Tools Subjective Listβ80Updated last year
- Nessie: Transactional Catalog for Data Lakes with Git-like semanticsβ1,033Updated this week
- Dataform is a framework for managing SQL based data operations in BigQueryβ848Updated this week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β507Updated 2 months ago
- Making DAG construction easierβ242Updated this week
- The premier open source Data Quality solutionβ596Updated 3 weeks ago
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your businessβ¦β1,212Updated last month
- Open Control Plane for Tables in Data Lakehouseβ308Updated this week
- One framework to develop, deploy and operate data workflows with Python and SQL.β428Updated this week
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β788Updated 2 years ago
- dbt (data build tool) adapter for the Dremioβ43Updated 3 months ago
- Data Pipeline Framework using the singer.io specβ640Updated last week
- Writes the Singer format from Pythonβ544Updated last month
- Use SQL to build ELT pipelines on a data lakehouse.β285Updated 2 years ago
- re_data - fix data issues before your users & CEO would discover them πβ1,551Updated 6 months ago
- This repository is a getting started guide to Singer.β1,268Updated 2 months ago
- Great Expectations Airflow operatorβ159Updated 2 weeks ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β347Updated this week
- An open protocol for secure data sharingβ769Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ1,906Updated last week
- Dremio Container Toolsβ155Updated 2 weeks ago