Poogles / piiregexLinks
Search for PII in Python
☆29Updated last year
Alternatives and similar repositories for piiregex
Users that are interested in piiregex are comparing it to the libraries listed below
Sorting:
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆89Updated 2 years ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆82Updated 2 months ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆59Updated this week
- A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.☆43Updated last year
- Graphistry admin docs: launch, configure, use, & debug☆27Updated last week
- Neo4j Cybersecurity Demo☆18Updated 3 years ago
- Security and compliance proxy for LLM APIs☆47Updated last year
- CLK hash: hash pii for entity matching☆47Updated 2 months ago
- Library for identification, anonymization and de-anonymization of PII data☆22Updated 2 years ago
- Common crawl extractor☆78Updated last year
- semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language …☆28Updated 11 months ago
- Explore AI Supply Chain Risk with the AI Risk Database☆58Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 3 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 3 years ago
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆227Updated this week
- S3 vector database for LLM Agents and RAG.☆44Updated last year
- This repo walks you through how to use transfer learning to fine tune a LLM (large language model) using UK Supreme Court case law as the…☆36Updated last year
- Code accompanying AWS blog post "Build a Semantic Search Engine for Tabular Columns with Transformers and Amazon OpenSearch Service"☆17Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- Python implementations of record linkage blocking techniques.☆21Updated last year
- A CLI for identifying potential Personally Identifiable Information in datasets.☆13Updated 6 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆139Updated last year
- Use Kore.ai's Knowledge Graph Generator to automatically extract terms from FAQs, define the hierarchy between these terms, and also asso…☆15Updated 2 years ago
- Train a model, and detect gibberish strings with it.☆64Updated 3 years ago
- List of Sanctions and Most wanted☆28Updated 8 years ago
- A Python library to perform NER on structured data and generate PII with Faker☆30Updated last year
- ☆49Updated last year
- Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index☆45Updated 2 months ago