nl4opt / ORQA

[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
54Updated 2 weeks ago

Alternatives and similar repositories for ORQA:

Users that are interested in ORQA are comparing it to the libraries listed below