microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆231Updated last week
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- SpanMarker for Named Entity Recognition☆447Updated 7 months ago
- Deliver safe & effective language models☆533Updated this week
- Fiddler Auditor is a tool to evaluate language models.☆185Updated last year
- Zero and Few shot named entity & relationships recognition☆384Updated 3 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Public blueprints for data use cases☆83Updated last week
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆89Updated 2 years ago
- Robust de-identification of medical notes using transformer architectures☆53Updated 3 years ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆317Updated last month
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆139Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆161Updated last week
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Open source no-code system for text annotation and building of text classifiers☆265Updated 2 months ago
- Metafeature Extraction for Unstructured Data☆102Updated 5 months ago
- ☆33Updated 3 years ago
- ☆39Updated 2 years ago
- CUAD (NeurIPS 2021)☆446Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆219Updated 7 months ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆184Updated last year
- A spaCy wrapper for GliNER☆119Updated 6 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆83Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆125Updated 3 weeks ago
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆115Updated 3 years ago
- Pebblo enables developers to safely load data and promote their Gen AI app to deployment☆147Updated 2 months ago
- A Python client for the Unstructured Platform API☆106Updated last week
- Product analytics for AI Assistants☆154Updated 2 months ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆498Updated last year