jamesmurdza / agenteval

Automated testing and benchmarking for code generation agents.
17Updated last year

Related projects: