babelcloud / LLM-RGB

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
143Updated this week

Alternatives and similar repositories for LLM-RGB:

Users that are interested in LLM-RGB are comparing it to the libraries listed below