microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆236Updated last week
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- Deliver safe & effective language models☆538Updated this week
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆89Updated 3 weeks ago
- SpanMarker for Named Entity Recognition☆451Updated 8 months ago
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆140Updated last year
- Robust de-identification of medical notes using transformer architectures☆53Updated 3 years ago
- An open-source compliance-centered evaluation framework for Generative AI models☆163Updated last week
- Zero and Few shot named entity & relationships recognition☆386Updated this week
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆163Updated this week
- Public blueprints for data use cases☆84Updated last week
- CUAD (NeurIPS 2021)☆452Updated 2 years ago
- A spaCy wrapper for GliNER☆118Updated 7 months ago
- Fiddler Auditor is a tool to evaluate language models.☆187Updated last year
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆657Updated 2 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆64Updated last month
- A Python client for the Unstructured Platform API☆106Updated this week
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆46Updated 6 years ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 2 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Python Data Anonymization & Masking Library For Data Science Tasks☆272Updated 2 years ago
- Open source no-code system for text annotation and building of text classifiers☆265Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated last month
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆938Updated 7 months ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆93Updated 2 weeks ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆500Updated last year
- Library for clinical NLP with spaCy.☆598Updated last month
- ☆33Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Gateway into the John Snow Labs Ecosystem☆70Updated this week
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆450Updated last month