fvaleye / metadata-guardianLinks
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
β18Updated last month
Alternatives and similar repositories for metadata-guardian
Users that are interested in metadata-guardian are comparing it to the libraries listed below
Sorting:
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β65Updated last week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualitβ¦β68Updated 3 weeks ago
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- β23Updated last year
- dlt-dagster-demoβ13Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.β51Updated 2 years ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.β72Updated 3 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applicationsβ45Updated last year
- Open Source Data Quality Monitoring.β165Updated this week
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.β204Updated 6 months ago
- dagster scikit-learn pipeline example.β46Updated 2 years ago
- Parse dbt artifacts and search dbt models with Algoliaβ52Updated 4 years ago
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated last year
- Library of Prefect tasks and utilities.β10Updated last year
- Data pipelines from re-usable componentsβ107Updated last month
- β21Updated 4 months ago
- Pandas helper functionsβ31Updated 2 years ago
- Fake Pandas / PySpark DataFrame creatorβ48Updated last year
- Playground for using large language models into the Modern Data Stack for entity matchingβ108Updated 2 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.β10Updated 2 years ago
- A collection of python utility functionsβ11Updated 2 months ago
- Utility functions for dbt projects running on Sparkβ34Updated 3 weeks ago
- real-time data + ML pipelineβ53Updated 2 weeks ago
- Next generation compute platform for the post-modern data stackβ20Updated this week
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β89Updated this week
- Build your feature store with macros right within your dbt repositoryβ39Updated 3 years ago
- A curated list of dagster code snippets for data engineersβ56Updated last year
- ODD Specification is a universal open standard for collecting metadata.β145Updated last year
- Ibis analytics, with Ibis (and more!)β23Updated last year