jamesmurdza / agenteval

Automated testing and benchmarking for code generation agents.
17Updated last year

Related projects

Alternatives and complementary repositories for agenteval