BCG-X-Official / artkitLinks
Automated prompt-based testing and evaluation of Gen AI applications
☆163Updated 10 months ago
Alternatives and similar repositories for artkit
Users that are interested in artkit are comparing it to the libraries listed below
Sorting:
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- An open-source compliance-centered evaluation framework for Generative AI models☆178Updated 2 weeks ago
- The fastest Trust Layer for AI Agents☆146Updated 7 months ago
- A tool for evaluating LLMs☆428Updated last year
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆123Updated 3 years ago
- A catalog of design patterns when building generative AI applications☆270Updated last month
- A Lightweight Library for AI Observability☆253Updated 10 months ago
- Red-Teaming Language Models with DSPy☆249Updated 10 months ago
- Framework for LLM evaluation, guardrails and security☆114Updated last year
- A small library of LLM judges☆311Updated 5 months ago
- ☆76Updated last year
- 🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…☆974Updated last year
- AI Verify☆39Updated this week
- Sample notebooks and prompts for LLM evaluation☆159Updated 2 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆398Updated 2 years ago
- Automated knowledge graph creation SDK☆122Updated last year
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆151Updated 2 years ago
- ☆262Updated last month
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated 2 years ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆246Updated last year
- ☆163Updated 11 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆295Updated 2 months ago
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆95Updated 2 weeks ago
- Simple, Pythonic building blocks to evaluate LLM applications.☆246Updated 2 months ago
- ☆54Updated last year
- ☆171Updated last month
- A curated list of awesome synthetic data tools (open source and commercial).☆231Updated 2 years ago
- Synthetic Data SDK ✨☆696Updated last week
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆744Updated 6 months ago
- Guardrails for secure and robust agent development☆378Updated 5 months ago