AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆137Updated 6 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- Pydantic extension for annotating autocorrecting fields.☆222Updated last year
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆79Updated last year
- Python browser sandbox.☆175Updated 4 months ago
- ☆75Updated 4 months ago
- OpenTelemetry Instrumentation for AI Observability☆533Updated this week
- Work with OpenAI's streaming API at ease with Python generators☆121Updated last year
- Transform your pythonic research to an artifact that engineers can deploy easily.☆154Updated last month
- Calculate prices for calling LLM inference APIs.☆72Updated this week
- Convert an AI Agent into a A2A server! ✨☆90Updated 3 weeks ago
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- An example to use MultiModal capabilities with Pydantic-AI to process and analyze images☆33Updated 7 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆113Updated 3 weeks ago
- HyPSTER - HyperParameter optimization on STERoids☆49Updated 8 months ago
- ☆78Updated 9 months ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆112Updated last month
- Build reliable, secure, and production-ready AI apps easily.☆80Updated 2 weeks ago
- A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.☆92Updated last week
- A Lightweight Library for AI Observability☆250Updated 5 months ago
- Promptimize is a prompt engineering evaluation and testing toolkit.☆477Updated last month
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆115Updated this week
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆230Updated this week
- An AI extension for IPython that makes it work like Cursor☆67Updated 7 months ago
- ☆157Updated 2 weeks ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆160Updated this week
- ☆173Updated last year
- Open-source versioning, tracing, and annotation tooling.☆178Updated this week
- Python SDK for running evaluations on LLM generated responses☆291Updated 2 months ago
- Product analytics for AI Assistants☆155Updated 2 months ago
- The Logfire MCP Server is here!☆98Updated this week
- A small library of LLM judges☆251Updated last week