microsoft / presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
☆200Updated last month
Alternatives and similar repositories for presidio-research:
Users that are interested in presidio-research are comparing it to the libraries listed below
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆85Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆44Updated 5 years ago
- SpanMarker for Named Entity Recognition☆426Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆120Updated this week
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 3 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Zero and Few shot named entity & relationships recognition☆366Updated this week
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆109Updated 9 months ago
- Software that makes labeling PDFs easy.☆413Updated 11 months ago
- The robust European language model benchmark.☆99Updated last week
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆206Updated 2 weeks ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆102Updated 3 weeks ago
- Explainable Zero-Shot Topic Extraction☆62Updated 8 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆297Updated 5 months ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- Robust de-identification of medical notes using transformer architectures☆52Updated 2 years ago
- ☆46Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆144Updated this week
- CUAD (NeurIPS 2021)☆421Updated last year
- An open-source compliance-centered evaluation framework for Generative AI models☆147Updated 4 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆320Updated 5 months ago
- A Python client for the Unstructured Platform API☆98Updated this week
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆108Updated 2 years ago
- Project for open sourcing research efforts on Backward Compatibility in Machine Learning☆73Updated last year
- Spacy NER annotator using ipywidgets☆121Updated last year
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases wit…☆48Updated 3 years ago
- Sample notebooks and prompts for LLM evaluation☆124Updated last week