microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆238Updated last month
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- Deliver safe & effective language models☆543Updated last week
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆91Updated last week
- Robust de-identification of medical notes using transformer architectures☆54Updated 3 years ago
- SpanMarker for Named Entity Recognition☆453Updated 9 months ago
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆141Updated last year
- Zero and Few shot named entity & relationships recognition☆388Updated 3 weeks ago
- A spaCy wrapper for GliNER☆121Updated 8 months ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆84Updated last week
- Nesta's Skills Extractor Library☆144Updated 4 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆166Updated this week
- Public blueprints for data use cases☆84Updated 3 weeks ago
- Plug-and-play, zero-shot document processing pipelines.☆107Updated this week
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆240Updated 3 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated 8 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 2 months ago
- CUAD (NeurIPS 2021)☆451Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 3 months ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆500Updated last year
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆456Updated 2 months ago
- A Python client for the Unstructured Platform API☆105Updated this week
- Product analytics for AI Assistants☆153Updated 4 months ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- Generates synthetic data and user interfaces for privacy-preserving data sharing and analysis.☆123Updated last year
- Expose a Top2Vec model with a REST API.☆92Updated 2 years ago
- 🦦 weasel: A small and easy workflow system☆87Updated last year