eval-protocol / python-sdkLinks
The official Python SDK for Eval Protocol
☆94Updated last week
Alternatives and similar repositories for python-sdk
Users that are interested in python-sdk are comparing it to the libraries listed below
Sorting:
- Prompt Optimization☆73Updated this week
- ☆274Updated 2 weeks ago
- Eval Protocol (EP) is an open solution for doing reinforcement learning fine-tuning on existing agents — across any language, container, …☆32Updated last week
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆148Updated this week
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆273Updated 3 months ago
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- A framework for optimizing DSPy programs with RL☆308Updated 3 weeks ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆719Updated last month
- Letting Claude Code develop his own MCP tools :)☆123Updated 10 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 10 months ago
- Enriched Python function call graphs for agents and coding assistants☆126Updated 7 months ago
- This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates regularly.☆298Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- Agentic Research and Evaluation Suite☆61Updated this week
- Context Engineering Course with DSPy☆211Updated 6 months ago
- Run Surfer-H agents powered by Holo1 using the Surfer-H-CLI. Includes example tasks, scripts, and configurations.☆146Updated last month
- ☆140Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆81Updated 4 months ago
- Lightly-reviewed collection of community environments☆210Updated last week
- Metadspy: The framework for specifying—not programming—language models☆88Updated 7 months ago
- ☆162Updated 3 months ago
- 🤖 Headless IDE for AI agents☆200Updated 3 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆752Updated 8 months ago
- Deep Research for your internal data☆352Updated 7 months ago
- A tool kit for generating high quality prompts using DSPy GEPA optimizer☆296Updated last week
- ☆85Updated 5 months ago
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆151Updated last month
- Let Claude control a web browser on your machine.☆40Updated 8 months ago
- FACT – Fast Augmented Context Tools: FACT is a lean retrieval pattern that skips vector search. We cache every static token inside Claude…☆133Updated 6 months ago
- ACP is the Agent Control Plane - a distributed agent scheduler optimized for simplicity, clarity, and control. It is designed for outer-l…☆325Updated 7 months ago