BCG-X-Official / artkitLinks
Automated prompt-based testing and evaluation of Gen AI applications
☆151Updated 5 months ago
Alternatives and similar repositories for artkit
Users that are interested in artkit are comparing it to the libraries listed below
Sorting:
- Fiddler Auditor is a tool to evaluate language models.☆184Updated last year
- An open-source compliance-centered evaluation framework for Generative AI models☆159Updated this week
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆115Updated 3 years ago
- A tool for evaluating LLMs☆424Updated last year
- Simple, Pythonic building blocks to evaluate LLM applications.☆233Updated 3 weeks ago
- AI Verify☆27Updated this week
- A catalog of design patterns when building generative AI applications☆179Updated this week
- ☆54Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆160Updated last week
- Product analytics for AI Assistants☆155Updated 2 months ago
- Deliver safe & effective language models☆532Updated last week
- Build MLOps Pipelines in Minutes☆247Updated last week
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆454Updated 7 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆304Updated 2 weeks ago
- Framework for LLM evaluation, guardrails and security☆113Updated 11 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆132Updated last week
- ☆87Updated last year
- 🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…☆936Updated 8 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆94Updated last year
- A Lightweight Library for AI Observability☆250Updated 5 months ago
- Sample notebooks and prompts for LLM evaluation☆138Updated 2 months ago
- ☆160Updated 6 months ago
- ☆71Updated 9 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆278Updated 3 weeks ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 10 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆225Updated this week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆469Updated 6 months ago
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆398Updated last year
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆132Updated 9 months ago