Poogles / piiregexLinks
Search for PII in Python
☆30Updated last year
Alternatives and similar repositories for piiregex
Users that are interested in piiregex are comparing it to the libraries listed below
Sorting:
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆89Updated 3 weeks ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆46Updated 6 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆62Updated this week
- Graphistry admin docs: launch, configure, use, & debug☆28Updated 2 months ago
- Train a model, and detect gibberish strings with it.☆65Updated 3 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 4 years ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆84Updated 4 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆24Updated 3 years ago
- Common crawl extractor☆79Updated last year
- S3 vector database for LLM Agents and RAG.☆48Updated 2 years ago
- a general utility for anonymizing data☆125Updated 3 months ago
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆36Updated 7 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆45Updated 2 months ago
- A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL☆30Updated 3 years ago
- Library for identification, anonymization and de-anonymization of PII data☆22Updated 2 years ago
- ☆76Updated 9 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆38Updated last year
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆236Updated last week
- Python port of Scramjet framework☆35Updated last year
- Now included in rigour☆151Updated last week
- An example of graph embeddings for wikipedia page recommendations☆11Updated 4 years ago
- Graph Engine for Exploration and Search☆42Updated last year
- Efficient BM25 with DuckDB 🦆☆55Updated 8 months ago
- 🖍️ Highlight text in documents☆109Updated 4 months ago
- ☆51Updated this week
- This repo walks you through how to use transfer learning to fine tune a LLM (large language model) using UK Supreme Court case law as the…☆38Updated 2 years ago
- Tools to construct and process Common Crawl webgraphs☆96Updated 3 weeks ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆321Updated last year
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆45Updated 2 years ago