opendatadiscovery / awesome-data-catalogsView external linksLinks
π Awesome Data Catalogs and Observability Platforms.
β989Aug 14, 2025Updated 6 months ago
Alternatives and similar repositories for awesome-data-catalogs
Users that are interested in awesome-data-catalogs are comparing it to the libraries listed below
Sorting:
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your businessβ¦β1,378Jan 15, 2026Updated last month
- ODD Specification is a universal open standard for collecting metadata.β146Oct 28, 2024Updated last year
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repβ¦β8,674Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadataβ2,119Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,249Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interactingβ¦β4,738Feb 9, 2026Updated last week
- The Metadata Platform for your Data and AI Stackβ11,577Updated this week
- Open-source metadata collector based on ODD Specificationβ44Nov 6, 2023Updated 2 years ago
- Data Contracts engine for the modern data stack. https://www.soda.ioβ2,288Updated this week
- An Open Standard for lineage metadata collectionβ2,304Feb 8, 2026Updated last week
- A curated list of awesome dbt resourcesβ1,638Feb 5, 2026Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,891Updated this week
- Compare tables within or across databasesβ2,990May 17, 2024Updated last year
- Egeria coreβ894Feb 4, 2026Updated last week
- Always know what to expect from your data.β11,133Updated this week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β81Feb 5, 2026Updated last week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,341Feb 9, 2026Updated last week
- Work with your web service, database, and streaming schemas in a single format.β350Dec 30, 2025Updated last month
- π§ Build, run, and manage data pipelines for integrating and transforming data.β8,645Updated this week
- A curated list of awesome DataOps toolsβ225Dec 10, 2025Updated 2 months ago
- data load tool (dlt) is an open source Python library that makes data loading easy π οΈβ4,903Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β12,224Feb 9, 2026Updated last week
- An orchestration platform for the development, production, and observation of data assets.β14,930Feb 9, 2026Updated last week
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tabβ¦β486Updated this week
- π¦ A curated list of awesome DuckDB resourcesβ2,266Feb 4, 2026Updated last week
- Nessie: Transactional Catalog for Data Lakes with Git-like semanticsβ1,413Updated this week
- A federated, open-source data catalog for all your big data and small dataβ584Feb 5, 2026Updated last week
- Python SQL Parser and Transpilerβ8,906Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakβ¦β20,668Feb 9, 2026Updated last week
- πββοΈ Minimalist SQL orchestratorβ302Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.β261Dec 5, 2023Updated 2 years ago
- re_data - fix data issues before your users & CEO would discover them πβ1,569Apr 30, 2024Updated last year
- dbt adapter for DuckDBβ1,226Feb 9, 2026Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,480Updated this week
- Self-serve BI to 10x your data team β‘οΈβ5,546Updated this week
- Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of codeβ1,137Updated this week
- lakeFS - Data version control for your data lake | Git for dataβ5,141Feb 9, 2026Updated last week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β856Apr 5, 2024Updated last year
- π³ The stupidly simple CLI workspace for your data warehouse.β728Feb 8, 2023Updated 3 years ago