SuperCLUE高考作文机器自动阅卷系统
☆17Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for SuperCLUEgkzw
Users that are interested in SuperCLUEgkzw are comparing it to the libraries listed below
Sorting:
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Apr 21, 2024Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- benchmark of KgCLUE, with different models and methods☆28Dec 13, 2021Updated 4 years ago
- Code and Data for our EMNLP 2020 paper titled 'Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multiho…☆28Feb 9, 2022Updated 4 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆28Jun 19, 2021Updated 4 years ago
- resources for the IBM Airlines Table-Question-Answering Benchmark☆33Jul 11, 2022Updated 3 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆50Jun 30, 2025Updated 8 months ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated last year
- ☆41Nov 30, 2023Updated 2 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- Efficient misspecification uncertainties for linear regression☆16Feb 19, 2026Updated 2 weeks ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated 2 months ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- ☆12Jan 11, 2026Updated last month
- something for paper agent☆11Dec 18, 2024Updated last year
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- ☆49Aug 6, 2024Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- Lucene open-domain QA retrieval in python☆11Feb 18, 2021Updated 5 years ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- ☆10Apr 17, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago