tokern / piicatcherLinks
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
☆316Updated last year
Alternatives and similar repositories for piicatcher
Users that are interested in piicatcher are comparing it to the libraries listed below
Sorting:
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Generate and Visualize Data Lineage from query history☆326Updated last year
- Security Analytics Using The Snowflake Data Warehouse☆183Updated last month
- The metrics layer for your data. Join us at https://metriql.com/slack☆309Updated 2 years ago
- Open source data observability platform☆326Updated 2 years ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆153Updated 2 weeks ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆102Updated last week
- PyAirbyte brings the power of Airbyte to every Python developer.☆273Updated this week
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆176Updated 10 months ago
- Data Tools Subjective List☆83Updated last year
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆445Updated this week
- Template for a data contract used in a data mesh.☆472Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Macros for calculating metrics☆218Updated 4 months ago
- Repository for the ActivitySchema spec and supporting materials☆419Updated 2 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,052Updated this week
- The Data Product Descriptor Specification (DPDS) Repository☆80Updated 5 months ago
- Open Control Plane for Tables in Data Lakehouse☆358Updated this week
- Airbyte made simple (no UI, no database, no cluster)☆175Updated 3 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆120Updated 4 months ago
- Serverless multi-protocol + multi-destination event collection system.☆206Updated 7 months ago
- The Data Contract Specification Repository☆355Updated 3 weeks ago
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆40Updated 10 months ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,561Updated last year
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆759Updated this week
- Apache Airflow integration for dbt☆405Updated last year
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆262Updated 2 weeks ago