benediktstroebl / agent-evalsLinks
☆22Updated last month
Alternatives and similar repositories for agent-evals
Users that are interested in agent-evals are comparing it to the libraries listed below
Sorting:
- Official Code Release for "Training a Generally Curious Agent"☆26Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset