magda-io / magda
A federated, open-source data catalog for all your big data and small data
☆518Updated this week
Alternatives and similar repositories for magda:
Users that are interested in magda are comparing it to the libraries listed below
- Tool to automate data quality checks on data pipelines☆253Updated 2 years ago
- ODD Specification is a universal open standard for collecting metadata.☆134Updated 2 months ago
- Generate and Visualize Data Lineage from query history☆316Updated last year
- Egeria core☆818Updated this week
- Data ingestion library for Amundsen to build graph and search index☆205Updated 10 months ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆302Updated last year
- 📙 Awesome Data Catalogs and Observability Platforms.☆762Updated 5 months ago
- An Open Standard for lineage metadata collection☆1,818Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,819Updated this week
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,259Updated 3 months ago
- An open protocol for secure data sharing☆797Updated 3 weeks ago
- The premier open source Data Quality solution☆608Updated 2 months ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆102Updated 2 years ago
- 🚎 Notebook sharing hub☆496Updated last year
- Dataform is a framework for managing SQL based data operations in BigQuery☆868Updated this week
- Making DAG construction easier☆250Updated this week
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆139Updated last year
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆103Updated 3 years ago
- CompassQL Query Language for visualization recommendation.☆274Updated 2 weeks ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,976Updated this week
- Front-end service library for Amundsen☆280Updated 7 months ago
- Dremio - the missing link in modern data☆1,401Updated 2 months ago
- Writes the Singer format from Python☆548Updated 3 months ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,564Updated 8 months ago
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineer…☆522Updated 4 months ago
- Dremio Container Tools☆157Updated last month
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆93Updated this week
- Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data.…☆506Updated 3 weeks ago
- Docker images and Docker Compose setup for CKAN [Not Maintained]☆83Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago