microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆257Updated 2 weeks ago
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- SpanMarker for Named Entity Recognition☆464Updated last year
- Deliver safe & effective language models☆553Updated last week
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆95Updated last month
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Zero and Few shot named entity & relationships recognition☆399Updated 4 months ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Public blueprints for data use cases☆85Updated 4 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated last week
- A Python client for the Unstructured Platform API☆112Updated this week
- An open-source compliance-centered evaluation framework for Generative AI models☆178Updated last month
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆321Updated 6 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆142Updated 2 months ago
- Robust de-identification of medical notes using transformer architectures☆57Updated 3 years ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Updated 6 years ago
- CUAD (NeurIPS 2021)☆465Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆255Updated 7 months ago
- A spaCy wrapper for GliNER☆128Updated 11 months ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆188Updated last year
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆504Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆178Updated this week
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆669Updated 7 months ago
- A (smart) rule based NLP module to extract job skills from text☆201Updated last year
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆124Updated 3 years ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆482Updated 5 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated last year
- Generates synthetic data and user interfaces for privacy-preserving data sharing and analysis.☆123Updated last year
- Open source no-code system for text annotation and building of text classifiers☆271Updated 8 months ago
- Qdrant Vector Database on Azure Cloud☆105Updated last year