tokern / piicatcherLinks
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
☆328Updated last year
Alternatives and similar repositories for piicatcher
Users that are interested in piicatcher are comparing it to the libraries listed below
Sorting:
- Generate and Visualize Data Lineage from query history☆326Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆89Updated 3 years ago
- Security Analytics Using The Snowflake Data Warehouse☆184Updated last week
- Open source data observability platform☆327Updated 3 years ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆158Updated last month
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆354Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated 2 years ago
- Tool to automate data quality checks on data pipelines☆256Updated 3 years ago
- ODD Specification is a universal open standard for collecting metadata.☆145Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slack☆319Updated 2 years ago
- Data Tools Subjective List☆88Updated 2 years ago
- The Privacy Engineering & Compliance Framework☆429Updated this week
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated last month
- Data Product Portal created by Dataminded☆196Updated last week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆176Updated 3 weeks ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 8 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆168Updated 3 months ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Make dbt docs and Apache Superset talk to one another☆154Updated 2 months ago
- ☆81Updated 9 months ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆81Updated this week
- Data Pipeline Framework using the singer.io spec☆656Updated last week
- Open Control Plane for Tables in Data Lakehouse☆375Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆952Updated 4 months ago
- PyAirbyte brings the power of Airbyte to every Python developer.☆313Updated this week
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆114Updated this week
- Making DAG construction easier☆281Updated 2 months ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 3 years ago
- re_data - fix data issues before your users & CEO would discover them 😊☆101Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆48Updated 6 years ago