microsoft / presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆190Updated last week
Alternatives and similar repositories for presidio-research:
Users that are interested in presidio-research are comparing it to the libraries listed below
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆83Updated last year
- Robust de-identification of medical notes using transformer architectures☆50Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆43Updated 5 years ago
- Models and Pipelines for the Spark NLP library☆112Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- Fiddler Auditor is a tool to evaluate language models.☆176Updated last year
- SpanMarker for Named Entity Recognition☆421Updated 2 months ago
- A spaCy wrapper for GliNER☆108Updated last month
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆105Updated 2 years ago
- Spacy NER annotator using ipywidgets☆121Updated 11 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆191Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 10 months ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Label data using HuggingFace's transformers and automatically get a prediction service☆184Updated last year
- A repository that showcases how you can use ZenML with Git☆69Updated 7 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆157Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- Clustering sentence embeddings to extract message intent☆172Updated 3 years ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- Deliver safe & effective language models☆517Updated this week
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- ☆61Updated 4 years ago
- Metafeature Extraction for Unstructured Data☆101Updated 7 months ago
- Synthetic Data SDK ✨☆283Updated this week
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆73Updated 10 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆126Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆309Updated last year