flageval-baai / FlagEvalLinks
FlagEval is an evaluation toolkit for AI large foundation models.
☆339Updated 9 months ago
Alternatives and similar repositories for FlagEval
Users that are interested in FlagEval are comparing it to the libraries listed below
Sorting:
- 大模型多维度中文对齐评测基准 (ACL 2024)☆421Updated 3 months ago
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆446Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.