Alice1998 / URS

URS Benchmark: Evaluating LLMs on User Reported Scenarios
18Updated 4 months ago

Related projects

Alternatives and complementary repositories for URS