google / litmus
Litmus is a comprehensive LLM testing and evaluation tool designed for GenAI Application Development. It provides a robust platform with a user-friendly UI for streamlining the process of building and assessing the performance of your LLM-powered applications.
☆23Updated 3 weeks ago
Alternatives and similar repositories for litmus:
Users that are interested in litmus are comparing it to the libraries listed below
- applications of https://github.com/PrefectHQ/marvin☆12Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆27Updated this week
- ☆26Updated last week
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆13Updated 5 months ago
- AI Testing Agent: Open Source AI Agent for Software Testing☆12Updated last month
- ☆1Updated 6 months ago
- ☆14Updated last year
- Tailored cloud solutions based on use case, cost, and preferences using natural language with Agentic AI to research, design, price, diag…☆29Updated 3 months ago
- ☆22Updated this week
- Generative AI Governance for Enterprises☆14Updated last month
- AI agent with RAG+ReAct on Indian Constitution & BNS☆59Updated 3 months ago
- This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/☆39Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆15Updated last year
- Perform facts checks on your conversations with LLMs to catch fake-news, misleading information, and LLMs confusion.☆13Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 7 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆49Updated 2 months ago
- ☆18Updated 3 months ago
- Geniusrise: Framework for building geniuses☆60Updated 8 months ago
- TLS & API keys for your LLM APIs☆15Updated last month
- Python module for running GPTScript☆14Updated last month
- ☆16Updated 8 months ago
- ☆39Updated 4 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆32Updated last week
- Pebblo enables developers to safely load data and promote their Gen AI app to deployment☆141Updated 2 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 9 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated 8 months ago
- A specification for OpenInference, a semantic mapping of ML inferences☆45Updated 9 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated last year
- Powered by SideGuide and GPT-3☆12Updated last year