☆98Dec 5, 2023Updated 2 years ago
Alternatives and similar repositories for XiezhiBenchmark
Users that are interested in XiezhiBenchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- make LLM easier to use☆59Jul 4, 2023Updated 2 years ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Oct 12, 2023Updated 2 years ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆114Jun 16, 2023Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆105Jul 20, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Mar 24, 2024Updated 2 years ago
- ☆96Mar 26, 2024Updated 2 years ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆139Jun 5, 2024Updated last year
- ☆772Jun 13, 2024Updated last year
- FlagEval is an evaluation toolkit for AI large foundation models.☆337Apr 24, 2025Updated last year
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆32Nov 5, 2021Updated 4 years ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,839Jul 27, 2025Updated 9 months ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆56Sep 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆747Jan 7, 2025Updated last year
- The official repo of INF-34B models trained by INF Technology.☆34Jul 25, 2024Updated last year
- Weekly update the Computer Science Paper upload to arxiv.☆105Feb 13, 2026Updated 2 months ago
- The world's most intuitive and reliable strongly-typed collaborative library☆28Feb 8, 2026Updated 2 months ago
- Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions☆15May 7, 2018Updated 7 years ago
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,037May 31, 2024Updated last year
- ☆21Sep 12, 2023Updated 2 years ago
- ☆164Apr 17, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.