fvaleye / metadata-guardian
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
β17Updated last week
Related projects β
Alternatives and complementary repositories for metadata-guardian
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β51Updated last week
- Build your feature store with macros right within your dbt repositoryβ37Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β26Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observβ¦β111Updated this week
- CLI for data platformβ19Updated 11 months ago
- β26Updated last year
- β21Updated 2 months ago
- quadipy is a python package to help transform structured data into RDF graph formatβ18Updated last year
- A curated list of dagster code snippets for data engineersβ50Updated 8 months ago
- Data Catalog for Databases and Data Warehousesβ31Updated 9 months ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.β53Updated this week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shouβ¦β10Updated last year
- A collection of python utility functionsβ12Updated 4 months ago
- Set up a Cost-Effective Modern Data Stack for a Charityβ19Updated 8 months ago
- S3 vector database for LLM Agents and RAG.β29Updated last year
- The sane way of building a data layer in Airflowβ24Updated 4 years ago
- scraping and querying documents for LLMsβ13Updated this week
- Examples of vector DB indexing and query with various vector databases.β12Updated 3 weeks ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customβ¦β44Updated 4 months ago
- Pandas helper functionsβ29Updated last year
- Pipeline definitions for managing data flows to power analytics at MIT Open Learningβ37Updated this week
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated 8 months ago
- Ibis analytics, with Ibis (and more!)β19Updated last month
- Awesome Orchest projects, both official and submitted by the community.β25Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β56Updated 2 years ago
- Batteries included toolkit for data engineering.β32Updated 2 months ago
- Contains example dags and terraform code to create a composer with a node pool to run podsβ13Updated 4 years ago
- A serverless duckDB deployment at GCPβ35Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVMβ41Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applicationsβ42Updated 9 months ago