BCG-X-Official / artkitLinks
Automated prompt-based testing and evaluation of Gen AI applications
☆155Updated 8 months ago
Alternatives and similar repositories for artkit
Users that are interested in artkit are comparing it to the libraries listed below
Sorting:
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆119Updated 3 years ago
- An open-source compliance-centered evaluation framework for Generative AI models☆170Updated this week
- Build MLOps Pipelines in Minutes☆249Updated 3 months ago
- Automated knowledge graph creation SDK☆122Updated 11 months ago
- Simple, Pythonic building blocks to evaluate LLM applications.☆243Updated this week
- ☆163Updated 8 months ago
- A tool for evaluating LLMs☆425Updated last year
- A Lightweight Library for AI Observability☆251Updated 8 months ago
- AI Verify☆36Updated last week
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆134Updated last year
- Framework for LLM evaluation, guardrails and security☆113Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆397Updated last year
- Sample notebooks and prompts for LLM evaluation☆153Updated last week
- ☆172Updated 3 weeks ago
- ☆89Updated last year
- Product analytics for AI Assistants☆155Updated 5 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- A catalog of design patterns when building generative AI applications☆219Updated last month
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆168Updated this week
- 🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…☆953Updated 11 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆83Updated last year
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆88Updated last week
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆319Updated 3 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 3 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆310Updated 2 weeks ago
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆463Updated 10 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆240Updated 2 weeks ago