tokern / piicatcher
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
☆310Updated last year
Alternatives and similar repositories for piicatcher
Users that are interested in piicatcher are comparing it to the libraries listed below
Sorting:
- Open source data observability platform☆325Updated 2 years ago
- Generate and Visualize Data Lineage from query history☆325Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆253Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slack☆305Updated 2 years ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆262Updated last week
- Make dbt docs and Apache Superset talk to one another☆142Updated 4 months ago
- Security Analytics Using The Snowflake Data Warehouse☆183Updated last week
- Open Control Plane for Tables in Data Lakehouse☆350Updated this week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipe…☆440Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆263Updated this week
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆180Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated last year
- ☆70Updated 2 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆161Updated 5 months ago
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆341Updated last week
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 3 years ago
- Data Tools Subjective List☆83Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- dbt-redshift contains all of the code enabling dbt to work with Amazon Redshift☆107Updated 3 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆235Updated last month
- A Database Change Management tool for Snowflake☆561Updated 2 months ago
- Macros for calculating metrics☆220Updated 3 months ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆147Updated this week
- Airbyte made simple (no UI, no database, no cluster)☆171Updated last month
- This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.☆116Updated 3 years ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆151Updated last month
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆148Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆845Updated last month
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆736Updated this week