fvaleye / metadata-guardianLinks
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
β18Updated last week
Alternatives and similar repositories for metadata-guardian
Users that are interested in metadata-guardian are comparing it to the libraries listed below
Sorting:
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- A monorepo of many Rill example projectsβ43Updated last week
- Next generation compute platform for the post-modern data stackβ16Updated last week
- β22Updated last month
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trinoβ91Updated this week
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.β10Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β63Updated last week
- β90Updated last year
- Palm CLI - the tool-belt for data teamsβ47Updated last year
- Build your feature store with macros right within your dbt repositoryβ39Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applicationsβ46Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- Data pipelines from re-usable componentsβ107Updated 2 years ago
- Unity Catalog UIβ43Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data typeβ¦β60Updated this week
- π A sweet and speedy code generator for dbt ποΈβ¨β29Updated 2 months ago
- β23Updated last year
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualitβ¦β64Updated this week
- dagster scikit-learn pipeline example.β45Updated 2 years ago
- Utility functions for dbt projects running on Sparkβ33Updated 8 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.β21Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.β25Updated 2 years ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.β19Updated last year
- real-time data + ML pipelineβ54Updated last week
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.β139Updated this week
- Pandas helper functionsβ31Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learningβ44Updated last week
- Parse dbt artifacts and search dbt models with Algoliaβ52Updated 4 years ago