benchmark of KgCLUE, with different models and methods
☆28Dec 13, 2021Updated 4 years ago
Alternatives and similar repositories for KgCLUEbench
Users that are interested in KgCLUEbench are comparing it to the libraries listed below
Sorting:
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Papers about the trend of Entity Linking in recent years.☆11Sep 5, 2022Updated 3 years ago
- KgCLUE: 大规模中文开源知识图谱问答☆454Jul 5, 2022Updated 3 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Apr 21, 2024Updated last year
- ☆25Jun 19, 2024Updated last year
- 中文机器阅读理解数据集☆109Mar 29, 2021Updated 4 years ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆39Jan 7, 2025Updated last year
- Proof in Lean of Fermat Last Theorem for exponent 3☆41Jun 25, 2024Updated last year
- The practitioner's guide to high-speed business automation at enterprise scale using Appian☆11Jan 18, 2023Updated 3 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- ☆12Jan 11, 2026Updated last month
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆12Mar 5, 2025Updated 11 months ago
- ☆12Nov 29, 2018Updated 7 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆25Dec 22, 2025Updated 2 months ago
- ☆11Oct 15, 2022Updated 3 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- An Android Application for making VoIP calls over FreeSWITCH server☆12Jun 4, 2015Updated 10 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- ☆11Aug 29, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated 2 years ago
- Automatic conversation between 2 OpenAI GPT powered characters who participate in a Turing test together.☆11Aug 9, 2023Updated 2 years ago
- Using pocketsphinx, cmuclmtk and NLTK to build speech recognition system☆14Sep 23, 2013Updated 12 years ago
- Deep Learning for Nature Language Processing at Standford☆13Sep 19, 2016Updated 9 years ago
- AI Music Generation group project☆12May 16, 2018Updated 7 years ago
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- 脑电波小项目☆11Jun 11, 2019Updated 6 years ago
- ☆12Nov 5, 2024Updated last year