microsoft / presidio-researchLinks
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆240Updated last month
Alternatives and similar repositories for presidio-research
Users that are interested in presidio-research are comparing it to the libraries listed below
Sorting:
- SpanMarker for Named Entity Recognition☆460Updated 9 months ago
- Deliver safe & effective language models☆545Updated this week
- CUAD (NeurIPS 2021)☆453Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 3 months ago
- Zero and Few shot named entity & relationships recognition☆391Updated last month
- Robust de-identification of medical notes using transformer architectures☆55Updated 3 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆94Updated 3 weeks ago
- ☆39Updated 2 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆70Updated 3 months ago
- An open-source compliance-centered evaluation framework for Generative AI models☆169Updated this week
- A Python library to de-identify medical records with state-of-the-art NLP methods.☆140Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆168Updated this week
- Metafeature Extraction for Unstructured Data☆103Updated 7 months ago
- Public blueprints for data use cases☆85Updated last month
- Haystack/OpenAI based chatbot curating a custom knowledgebase☆104Updated 2 years ago
- Generates synthetic data and user interfaces for privacy-preserving data sharing and analysis.☆123Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Active Learning for Text Classification in Python☆628Updated last week
- ☆34Updated 3 years ago
- A tool for evaluating LLMs☆425Updated last year
- A component orchestration engine☆28Updated last year
- Here you can find all the Tutorials for Haystack 📓☆342Updated last month
- Open source no-code system for text annotation and building of text classifiers☆268Updated 5 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Gateway into the John Snow Labs Ecosystem☆71Updated last week
- 📚 Datasets and models for instruction-tuning☆237Updated 2 years ago