microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆250Updated this week
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆94Updated 2 months ago
- SpanMarker for Named Entity Recognition☆462Updated 11 months ago
- Deliver safe & effective language models☆547Updated last month
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Public blueprints for data use cases☆85Updated 3 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆319Updated 5 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- A spaCy wrapper for GliNER☆125Updated 10 months ago
- Robust de-identification of medical notes using transformer architectures☆56Updated 3 years ago
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆194Updated 7 months ago
- CUAD (NeurIPS 2021)☆463Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆173Updated this week
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Sample notebooks and prompts for LLM evaluation☆156Updated last month
- A Python client for the Unstructured Platform API☆109Updated this week
- Zero and Few shot named entity & relationships recognition☆394Updated 2 months ago
- An open-source compliance-centered evaluation framework for Generative AI models☆174Updated this week
- Synthetic Data SDK ✨☆690Updated last week
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 4 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆118Updated 8 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆820Updated 9 months ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆669Updated 5 months ago
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆141Updated 3 weeks ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 4 months ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆166Updated last week
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆48Updated 6 years ago
- Unified Schema-Based Information Extraction☆355Updated this week
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Knowledge Extraction For Forms Accelerators & Examples☆222Updated last year