Alice1998 / URS

URS Benchmark: Evaluating LLMs on User Reported Scenarios
21Updated this week

Related projects

Alternatives and complementary repositories for URS