fvaleye / metadata-guardianLinks
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
β17Updated last month
Alternatives and similar repositories for metadata-guardian
Users that are interested in metadata-guardian are comparing it to the libraries listed below
Sorting:
- β22Updated 10 months ago
- Ibis analytics, with Ibis (and more!)β22Updated 9 months ago
- A collection of python utility functionsβ11Updated 11 months ago
- dagster scikit-learn pipeline example.β44Updated 2 years ago
- Utility functions for dbt projects running on Sparkβ34Updated 4 months ago
- IceRunner is an Apache Arrow Flight Server Implementation for Apache Iceberg Tablesβ9Updated 2 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.β10Updated 2 years ago
- scraping and querying documents for LLMsβ22Updated 3 weeks ago
- Palm CLI - the tool-belt for data teamsβ47Updated last year
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customβ¦β44Updated 11 months ago
- This extension makes vscode seamlessly work with dbt and bigqueryβ14Updated 2 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ36Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β59Updated last week
- Prefect integrations for working with OpenAI.β34Updated last year
- Next generation compute platform for the post-modern data stackβ15Updated last week
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.β21Updated 4 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB workerβ¦β18Updated last year
- A software engineering framework to jump start your machine learning projectsβ37Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shouβ¦β10Updated last year
- A curated list of dagster code snippets for data engineersβ55Updated last year
- DataForge helps data teams write functional transformation pipelines by leveraging software engineering principlesβ51Updated last month
- β11Updated last year
- Batteries included toolkit for data engineering.β34Updated 5 months ago
- β32Updated 6 months ago
- Using the Parquet file format with Pythonβ15Updated last year
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- A serverless duckDB deployment at GCPβ39Updated 2 years ago
- dlt-dagster-demoβ11Updated last year