AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆123Updated 4 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.☆49Updated this week
- Work with OpenAI's streaming API at ease with Python generators☆121Updated last year
- Transform your pythonic research to an artifact that engineers can deploy easily.☆153Updated 2 months ago
- HyPSTER - HyperParameter optimization on STERoids☆48Updated 6 months ago
- ☆10Updated 9 months ago
- A documentation assistant leveraging Model Context Protocol (MCP) to help programmers access the most up-to-date and relevant information…☆19Updated 2 months ago
- Make your GenAI Apps Safe & Secure Test & harden your system prompt☆486Updated 7 months ago
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated 10 months ago
- Self Support ChatBot☆16Updated 2 months ago
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆182Updated last week
- Hebrew oriented NER spaCy pipeline☆17Updated 9 months ago
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆248Updated this week
- A tiny LLM Agent with minimal dependencies, focused on local inference.☆52Updated 7 months ago
- A small library of LLM judges☆205Updated 2 weeks ago
- A plugin-based gateway that orchestrates other MCPs and allows developers to build upon it enterprise-grade agents.☆183Updated last month
- A powerful AI observability framework that provides comprehensive insights into agent interactions across platforms, enabling developers …☆81Updated 3 weeks ago
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included☆97Updated last week
- Open-source AI copilot that lets you chat with your observability data and code 🧙♂️☆351Updated last month
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆100Updated last month
- Record your service operations in production and replay them locally at any time in a sandbox☆106Updated 4 months ago
- ☆72Updated 7 months ago
- Metafeature Extraction for Unstructured Data☆101Updated 2 months ago
- Run evals using LLM☆25Updated last year
- Transform any OpenAPI/Swagger definition into a fully-featured Model Context Protocol (MCP) server☆141Updated this week
- Named Entity Recognition using Claude Citations☆74Updated 2 months ago
- 🚀 A list of Haystack Integrations, maintained by the community or deepset.☆89Updated this week
- Enriched Python function call graphs for agents and coding assistants☆96Updated last week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆37Updated last week
- Test Generation for Prompts☆91Updated last week