tokern / dbcatLinks
Data Catalog for Databases and Data Warehouses
☆35Updated last year
Alternatives and similar repositories for dbcat
Users that are interested in dbcat are comparing it to the libraries listed below
Sorting:
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- ☆34Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- ODD Specification is a universal open standard for collecting metadata.☆146Updated last year
- The Data Product Descriptor Specification (DPDS) Repository☆82Updated last year
- Ibis analytics, with Ibis (and more!)☆24Updated last year
- dbt's adapter for dremio☆48Updated 3 years ago
- Utility functions for dbt projects running on Spark☆34Updated 3 weeks ago
- Unity Catalog UI☆43Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆78Updated this week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 2 months ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 6 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆178Updated last week
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆95Updated 9 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Updated 2 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆108Updated this week
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆69Updated this week
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- A platform to manage the data product life cycle☆21Updated this week
- ☆22Updated 3 weeks ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- A Table format agnostic data sharing framework☆42Updated last year