microsoft / presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
☆3,855Updated this week
Related projects ⓘ
Alternatives and complementary repositories for presidio
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆177Updated this week
- A lightning fast Finite State machine and REgular expression manipulation library.☆1,835Updated last year
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆4,803Updated this week
- The Security Toolkit for LLM Interactions☆1,251Updated last month
- Postgres with GPUs for ML/AI apps.☆6,039Updated this week
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆9,460Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,666Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!☆4,797Updated this week
- Structured Text Generation☆9,573Updated this week
- Adding guardrails to large language models.☆4,150Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,203Updated this week
- The LLM Evaluation Framework☆3,747Updated this week
- 🦙 Integrating LLMs into structured NLP pipelines☆1,137Updated 3 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,531Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,997Updated this week
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convert…☆17,822Updated this week
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability v…☆6,815Updated this week
- A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distribu…☆4,829Updated 2 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,719Updated 3 months ago
- DSPy: The framework for programming—not prompting—language models☆19,066Updated this week
- Fit interpretable models. Explain blackbox machine learning.☆6,299Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆745Updated this week
- What's in your data? Extract schema, statistics and entities from datasets☆1,434Updated last week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆1,446Updated last week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆1,850Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,133Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆6,654Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,470Updated 8 months ago
- 🦉 Data Versioning and ML Experiments☆13,940Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆7,297Updated this week