google / litmus
Litmus is a comprehensive LLM testing and evaluation tool designed for GenAI Application Development. It provides a robust platform with a user-friendly UI for streamlining the process of building and assessing the performance of your LLM-powered applications.
☆31Updated last month
Alternatives and similar repositories for litmus:
Users that are interested in litmus are comparing it to the libraries listed below
- Security and compliance proxy for LLM APIs☆46Updated last year
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆50Updated 5 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Automated Quality Control for Dialogflow CX Agents☆14Updated 11 months ago
- ☆28Updated 3 weeks ago
- ☆1Updated 9 months ago
- ☆37Updated last week
- Pebblo enables developers to safely load data and promote their Gen AI app to deployment☆141Updated last month
- applications of https://github.com/PrefectHQ/marvin☆12Updated last year
- Generative AI Governance for Enterprises☆16Updated 3 months ago
- A specification for OpenInference, a semantic mapping of ML inferences☆46Updated last year
- 😎 Awesome list of resources about using and building AI software development systems☆110Updated 11 months ago
- ☆12Updated 2 years ago
- Natural Language Interfaces Powered by LLMs☆90Updated 8 months ago
- Demos of some issues with LangChain.☆31Updated last year
- augini: AI-Powered Tabular Data Assistant☆28Updated last month
- DevOps AI Assistant CLI. Ask questions about your AWS services, cloudwatch metrics, and billing.☆69Updated 8 months ago
- Reference architecture for LLM-based applications on Google Cloud Platform with Redis Enterprise as a high-performance data layer.☆32Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆34Updated this week
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- ☆31Updated last year
- CCCS security control profiles expressed using OSCAL☆12Updated 2 months ago
- A curated list of resources about all things Gemini in Google Cloud.☆72Updated 3 months ago
- ☆19Updated 6 months ago
- ☆10Updated 6 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆22Updated 2 years ago
- Retrieval Augmented Generation applications☆26Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆37Updated last year
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆33Updated last week