edwardcooper / piidetectLinks
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
☆45Updated 6 years ago
Alternatives and similar repositories for piidetect
Users that are interested in piidetect are comparing it to the libraries listed below
Sorting:
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆45Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆58Updated last month
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated 3 weeks ago
- A project to build a machine learning pipeline to detect personal identifiable information (PII)☆16Updated 2 years ago
- Search for PII in Python☆29Updated last year
- ☆28Updated 4 years ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 10 months ago
- Python package for deduplication/entity resolution using active learning☆80Updated 9 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- ☆48Updated this week
- A simple search engine to search medium stories built with streamlit and elasticsearch.☆40Updated 3 years ago
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆217Updated this week
- ☆17Updated last year
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- ☆46Updated 2 years ago
- ☆49Updated last year
- ☆57Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆49Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Expose a Top2Vec model with a REST API.☆90Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- ☆39Updated 3 months ago
- Fast model deployment on AWS Lambda☆14Updated last year
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated last month
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆24Updated last month
- ☆43Updated 2 years ago
- ☆13Updated 3 years ago