Poogles / piiregexLinks
Search for PII in Python
☆30Updated last year
Alternatives and similar repositories for piiregex
Users that are interested in piiregex are comparing it to the libraries listed below
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆48Updated 6 years ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆94Updated 3 weeks ago
- A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.☆44Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆62Updated this week
- Graphistry admin docs: launch, configure, use, & debug☆28Updated last week
- PyPi module for Graphlet AI Knowledge Graph Factory☆31Updated 2 years ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆83Updated 3 weeks ago
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆150Updated last year
- A CLI for identifying potential Personally Identifiable Information in datasets.☆14Updated 6 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Updated last year
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- Security and compliance proxy for LLM APIs☆49Updated 2 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated this week
- Neo4j Cybersecurity Demo☆17Updated 3 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆24Updated 3 years ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆326Updated last year
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆240Updated last month
- Common crawl extractor☆80Updated last year
- An open-source package for python to clean raw text data☆72Updated 2 years ago
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆64Updated 9 months ago
- Email Datasets can be found here☆72Updated 5 years ago
- S3 vector database for LLM Agents and RAG.☆48Updated 2 years ago
- ☆20Updated last year
- Graph databases, Knowledge Graphs, SPARQ☆80Updated 4 years ago
- Blazing fast fuzzy text search for Python.☆47Updated 6 months ago
- Index Common Crawl archives in tabular format☆121Updated this week
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆16Updated 4 years ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆86Updated 11 months ago