fvaleye / metadata-guardianLinks
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
β18Updated last week
Alternatives and similar repositories for metadata-guardian
Users that are interested in metadata-guardian are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β62Updated this week
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- A monorepo of many Rill example projectsβ45Updated last week
- Parse dbt artifacts and search dbt models with Algoliaβ52Updated 4 years ago
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β84Updated last week
- β22Updated 2 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.β21Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- β23Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.β25Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learningβ44Updated this week
- β26Updated 2 years ago
- β90Updated last year
- dlt-dagster-demoβ13Updated 2 years ago
- A curated list of dagster code snippets for data engineersβ56Updated last year
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ104Updated last week
- The Data Product Descriptor Specification (DPDS) Repositoryβ81Updated 9 months ago
- Playground for using large language models into the Modern Data Stack for entity matchingβ108Updated 2 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualitβ¦β65Updated 3 weeks ago
- Cloud-agnostic Python APIβ60Updated last year
- β33Updated last week
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.β10Updated 2 years ago
- Ibis analytics, with Ibis (and more!)β22Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ25Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....β77Updated last week
- Data pipelines from re-usable componentsβ107Updated 2 years ago
- π A sweet and speedy code generator for dbt ποΈβ¨β30Updated 3 months ago
- A write-audit-publish implementation on a data lake without the JVMβ45Updated last year
- dagster scikit-learn pipeline example.β46Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.β30Updated 3 weeks ago