tokern / dbcatLinks
Data Catalog for Databases and Data Warehouses
☆36Updated 2 years ago
Alternatives and similar repositories for dbcat
Users that are interested in dbcat are comparing it to the libraries listed below
Sorting:
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- The Data Product Descriptor Specification (DPDS) Repository☆83Updated last year
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 7 months ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆32Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆81Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- Unity Catalog UI☆43Updated last year
- Amundsen Gremlin☆22Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated this week
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- ☆35Updated 2 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated 2 years ago
- ODD Specification is a universal open standard for collecting metadata.☆146Updated last year
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 4 months ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆180Updated this week
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆43Updated 4 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆108Updated last week
- Utility functions for dbt projects running on Spark☆34Updated last month
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 3 months ago
- Ibis analytics, with Ibis (and more!)☆24Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Updated 2 years ago
- Transporter for integrating OpenLineage with OpenMetadata☆15Updated 5 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆97Updated this week