microsoft / presidioLinks

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

☆5,116

Alternatives and similar repositories for presidio

Users that are interested in presidio are comparing it to the libraries listed below

Sorting:

microsoft / presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…
☆228Updated last week
Arize-ai / phoenix
AI Observability & Evaluation
☆6,505Updated this week
guardrails-ai / guardrails
Adding guardrails to large language models.
☆5,332Updated 2 weeks ago
argilla-io / argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆4,613Updated this week
protectai / llm-guard
The Security Toolkit for LLM Interactions
☆1,889Updated last week
qdrant / fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
☆2,253Updated last week
Helicone / helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
☆4,234Updated this week
confident-ai / deepeval
The LLM Evaluation Framework
☆9,662Updated this week
urchade / GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
☆2,196Updated this week
circlemind-ai / fast-graphrag
RAG that intelligently adapts to your use case, data, and queries
☆3,409Updated last month
promptfoo / promptfoo
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,…
☆7,742Updated this week
truera / trulens
Evaluation and Tracking for LLM Experiments and AI Agents
☆2,675Updated this week
treeverse / lakeFS
lakeFS - Data version control for your data lake | Git for data
☆4,794Updated this week
kuzudb / kuzu
Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.
☆2,902Updated this week
langroid / langroid
Harness LLMs with Multi-Agent Programming
☆3,545Updated last week
NVIDIA / NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
☆4,927Updated this week
BoundaryML / baml
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
☆4,949Updated this week
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,587Updated 2 months ago
weaviate / weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…
☆14,031Updated this week
openlit / openlit
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…
☆1,757Updated this week
postgresml / postgresml
Postgres with GPUs for ML/AI apps.
☆6,414Updated last month
NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…
☆2,720Updated last week
neuml / txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
☆11,323Updated 2 weeks ago
Scale3-Labs / langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…
☆993Updated 2 months ago
apache / burr
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…
☆1,750Updated last week
Unstructured-IO / unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆12,035Updated last week
langfuse / langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…
☆14,218Updated this week
adbar / trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…
☆4,523Updated 2 months ago
capitalone / DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
☆1,503Updated this week
AgentOps-AI / tokencost
Easy token price estimates for 400+ LLMs. TokenOps.
☆1,754Updated this week