chaochun / nlu-asdiv-datasetView external linksLinks
☆52Jul 4, 2023Updated 2 years ago
Alternatives and similar repositories for nlu-asdiv-dataset
Users that are interested in nlu-asdiv-dataset are comparing it to the libraries listed below
Sorting:
- NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?☆139Jun 30, 2022Updated 3 years ago
- Code for MAWPS: A Math Word Problem Repository☆41Mar 23, 2023Updated 2 years ago
- A algebraic word problem dataset, with multiple choice questions annotated with rationales.☆331Nov 2, 2017Updated 8 years ago
- Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)☆79Sep 2, 2021Updated 4 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆14May 7, 2025Updated 9 months ago
- ☆33Jan 25, 2026Updated 3 weeks ago
- ☆24Nov 20, 2025Updated 2 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆23Jan 20, 2026Updated 3 weeks ago
- Math-aware QA system☆18Dec 17, 2022Updated 3 years ago
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆64Feb 13, 2023Updated 3 years ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆17Jan 18, 2025Updated last year
- A list of awesome machine question answering dataset - 機器問答數據集☆15Dec 24, 2019Updated 6 years ago
- Code for Teacher-Student Networks with Multiple Decoders for Solving Math Word Problem (IJCAI 2020).☆11Sep 19, 2020Updated 5 years ago
- ☆52Updated this week
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated last year
- The data for the CRASS-benchmark☆16Oct 24, 2022Updated 3 years ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆111May 22, 2025Updated 8 months ago
- ☆1,390Jan 21, 2024Updated 2 years ago
- EMNLP'19: "Structuring Latent Spaces for Stylized Response Generation"☆63May 16, 2020Updated 5 years ago
- Code for the paper ``Text2Math: End-to-end Parsing Text into Math Expressions" accepted by EMNLP 2019☆16Aug 20, 2019Updated 6 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- SuperCLUE高考作文机器自动阅卷系统☆17Jun 8, 2023Updated 2 years ago
- MWPToolkit is an open-source framework for math word problem(MWP) solvers.☆165Sep 28, 2022Updated 3 years ago
- ☆20Nov 3, 2024Updated last year
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Aug 13, 2025Updated 6 months ago
- A prompt defence is a multi-layer defence that can be used to protect your applications against prompt injection attacks.☆21Dec 12, 2025Updated 2 months ago
- A unified benchmark for math reasoning☆89Jan 25, 2023Updated 3 years ago
- AFlow & MathAI☆19Feb 24, 2025Updated 11 months ago
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆21Feb 29, 2024Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Jun 18, 2023Updated 2 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Sep 3, 2024Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆63Mar 26, 2024Updated last year