microsoft / presidioLinks
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
☆5,116Updated this week
Alternatives and similar repositories for presidio
Users that are interested in presidio are comparing it to the libraries listed below
Sorting:
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆228Updated last week
- AI Observability & Evaluation☆6,505Updated this week
- Adding guardrails to large language models.☆5,332Updated 2 weeks ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,613Updated this week
- The Security Toolkit for LLM Interactions☆1,889Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,253Updated last week
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆4,234Updated this week
- The LLM Evaluation Framework☆9,662Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,196Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,409Updated last month
- Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,…☆7,742Updated this week
- Evaluation and Tracking for LLM Experiments and AI Agents☆2,675Updated this week
- lakeFS - Data version control for your data lake | Git for data☆4,794Updated this week
- Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.☆2,902Updated this week
- Harness LLMs with Multi-Agent Programming☆3,545Updated last week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,927Updated this week
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆4,949Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,587Updated 2 months ago
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆14,031Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,757Updated this week
- Postgres with GPUs for ML/AI apps.☆6,414Updated last month
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,720Updated last week
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,323Updated 2 weeks ago
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆993Updated 2 months ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,750Updated last week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,035Updated last week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆14,218Updated this week
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆4,523Updated 2 months ago
- What's in your data? Extract schema, statistics and entities from datasets☆1,503Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,754Updated this week