microsoft / presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆190Updated last week
Alternatives and similar repositories for presidio-research:
Users that are interested in presidio-research are comparing it to the libraries listed below
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆43Updated 5 years ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆83Updated last year
- SpanMarker for Named Entity Recognition☆421Updated 2 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆54Updated last year
- Robust de-identification of medical notes using transformer architectures☆50Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year
- Project for open sourcing research efforts on Backward Compatibility in Machine Learning☆73Updated last year
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆105Updated 2 years ago
- Zero and Few shot named entity & relationships recognition☆360Updated 3 months ago
- Metafeature Extraction for Unstructured Data☆101Updated 7 months ago
- Spacy NER annotator using ipywidgets☆121Updated 11 months ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆182Updated 10 months ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- A spaCy wrapper for GliNER☆108Updated last month
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.☆112Updated 10 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆73Updated 10 months ago
- Deliver safe & effective language models☆517Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆119Updated last week
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineering☆40Updated 4 months ago
- Fiddler Auditor is a tool to evaluate language models.☆176Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆95Updated 2 months ago
- Models and Pipelines for the Spark NLP library☆112Updated 3 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- Knowledge Extraction For Forms Accelerators & Examples☆220Updated 8 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- Generates synthetic data and user interfaces for privacy-preserving data sharing and analysis.☆116Updated 11 months ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 8 months ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated last week