AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆140Updated 7 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- Convert an AI Agent into a A2A server! ✨☆113Updated 2 months ago
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆79Updated last week
- Python browser sandbox.☆177Updated 5 months ago
- Pydantic extension for annotating autocorrecting fields.☆222Updated last year
- Work with OpenAI's streaming API at ease with Python generators☆122Updated last year
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆260Updated this week
- OpenTelemetry Instrumentation for AI Observability☆598Updated this week
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆132Updated this week
- ☆82Updated 10 months ago
- ☆74Updated 5 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆115Updated 2 months ago
- Calculate prices for calling LLM inference APIs.☆91Updated last week
- Claudette is Claude's friend☆276Updated 2 weeks ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆163Updated this week
- A fun party trick to run Python code from another venv into this one.☆203Updated 6 months ago
- ☆176Updated last month
- A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.☆104Updated last week
- OpenAI powered AI CLI in just a few lines of code.☆124Updated last year
- HyPSTER - Configuration Framework for Optimizing AI & AI Systems☆52Updated 2 weeks ago
- A small library of LLM judges☆282Updated last month
- An AI extension for IPython that makes it work like Cursor☆67Updated 8 months ago
- The Logfire MCP Server is here!☆107Updated last month
- Jambo - JSON Schema to Pydantic Converter☆59Updated this week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆325Updated last week
- Transform your pythonic research to an artifact that engineers can deploy easily.☆154Updated 3 months ago
- MCP tools for Roaming RAG☆54Updated 3 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated 11 months ago
- ☆46Updated this week
- Promptimize is a prompt engineering evaluation and testing toolkit.☆480Updated last month
- ☆178Updated this week