magda-io / magdaLinks
A federated, open-source data catalog for all your big data and small data
☆542Updated this week
Alternatives and similar repositories for magda
Users that are interested in magda are comparing it to the libraries listed below
Sorting:
- Egeria core☆847Updated this week
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆855Updated last month
- Generate and Visualize Data Lineage from query history☆326Updated last year
- ODD Specification is a universal open standard for collecting metadata.☆138Updated 7 months ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆104Updated 2 years ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆307Updated 2 years ago
- An Open Standard for lineage metadata collection☆1,953Updated this week
- The premier open source Data Quality solution☆632Updated last week
- Dremio Container Tools☆161Updated last month
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Updated 2 years ago
- Data Pipeline Framework using the singer.io spec☆648Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,927Updated last week
- Dremio - the missing link in modern data☆1,431Updated last month
- Data Tools Subjective List☆83Updated last year
- An open protocol for secure data sharing☆840Updated last week
- A curated list of awesome DataOps tools☆189Updated 7 months ago
- Front-end service library for Amundsen☆280Updated last month
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,226Updated 3 months ago
- sgr (command line client for Splitgraph) and the splitgraph Python library☆321Updated last year
- Open Source Self-service Business Intelligence with Version Control☆323Updated 2 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,217Updated this week
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆529Updated last month
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,583Updated last week
- CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use…☆4,720Updated last week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated last week
- General Metadata Architecture☆125Updated last week
- Writes the Singer format from Python☆563Updated 2 months ago