magda-io / magdaLinks
A federated, open-source data catalog for all your big data and small data
☆551Updated this week
Alternatives and similar repositories for magda
Users that are interested in magda are comparing it to the libraries listed below
Sorting:
- Egeria core☆858Updated this week
- ODD Specification is a universal open standard for collecting metadata.☆142Updated 9 months ago
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆885Updated this week
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆105Updated 2 years ago
- Dremio Container Tools☆162Updated 3 months ago
- The premier open source Data Quality solution☆641Updated 3 weeks ago
- Generate and Visualize Data Lineage from query history☆326Updated 2 years ago
- Dremio - the missing link in modern data☆1,435Updated 3 months ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆311Updated 2 years ago
- Data Pipeline Framework using the singer.io spec☆653Updated this week
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to b…☆797Updated 2 years ago
- An Open Standard for lineage metadata collection☆2,044Updated last week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,983Updated 2 weeks ago
- Data Tools Subjective List☆86Updated last year
- Apache Superset UI packages☆343Updated 3 years ago
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆480Updated this week
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆104Updated 4 years ago
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,335Updated 5 months ago
- Apache DataLab (incubating)☆152Updated last year
- The Data Product Descriptor Specification (DPDS) Repository☆80Updated 6 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated 2 weeks ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,113Updated 2 years ago
- Writes the Singer format from Python☆570Updated last month
- Use SQL to build ELT pipelines on a data lakehouse.☆287Updated 3 years ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- A curated list of awesome DataOps tools☆196Updated 3 weeks ago
- Dataform is a framework for managing SQL based data operations in BigQuery☆914Updated this week
- Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data☆767Updated this week
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆142Updated last year