microsoft / presidio-researchLinks

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

☆245

Alternatives and similar repositories for presidio-research

Users that are interested in presidio-research are comparing it to the libraries listed below

Sorting:

Pacific-AI-Corp / langtest
Deliver safe & effective language models
☆545Updated 3 weeks ago
tomaarsen / SpanMarkerNER
SpanMarker for Named Entity Recognition
☆463Updated 10 months ago
EdyVision / pii-codex
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
☆94Updated last month
davidberenstein1957 / concise-concepts
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…
☆244Updated 2 years ago
IBM / zshot
Zero and Few shot named entity & relationships recognition
☆391Updated 2 months ago
explosion / prodigy-openai-recipes
✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
☆322Updated 2 years ago
obi-ml-public / ehr_deidentification
Robust de-identification of medical notes using transformer architectures
☆55Updated 3 years ago
fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
davidberenstein1957 / classy-classification
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…
☆220Updated 10 months ago
TonicAI / tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
☆319Updated 4 months ago
NorskRegnesentral / text-anonymization-benchmark
Annotated corpus + evaluation metrics for text anonymisation
☆70Updated 3 months ago
veritas-toolkit / diagnosis-tool
☆39Updated 2 years ago
nedap / deidentify
A Python library to de-identify medical records with state-of-the-art NLP methods.
☆140Updated 2 years ago
superwise-ai / elemeta
Metafeature Extraction for Unstructured Data
☆103Updated 8 months ago
gretelai / gretel-blueprints
Public blueprints for data use cases
☆85Updated 2 months ago
Giskard-AI / awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
☆193Updated 7 months ago
jackboyla / GLiREL
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
☆246Updated 5 months ago
explosion / prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
☆501Updated last year
compl-ai / compl-ai
An open-source compliance-centered evaluation framework for Generative AI models
☆170Updated last week
TheAtticusProject / cuad
CUAD (NeurIPS 2021)
☆453Updated 2 years ago
label-sleuth / label-sleuth
Open source no-code system for text annotation and building of text classifiers
☆269Updated 5 months ago
interpretml / interpret-text
A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in …
☆429Updated last year
nestauk / ojd_daps_skills
Nesta's Skills Extractor Library
☆147Updated 5 months ago
theirstory / gliner-spacy
A spaCy wrapper for GliNER
☆124Updated 9 months ago
explosion / healthsea
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
☆92Updated 3 years ago
ieriii / spacy-annotator
Spacy NER annotator using ipywidgets
☆123Updated last year
HLasse / TextDescriptives
A Python library for calculating a large variety of metrics from text
☆353Updated 11 months ago
EthicalML / fml-security
Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…
☆119Updated 3 years ago
deepset-ai / haystack-core-integrations
Additional packages (components, document stores and the likes) to extend the capabilities of Haystack
☆169Updated this week
webis-de / small-text
Active Learning for Text Classification in Python
☆631Updated last week