arkilpatel / SVAMP
NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?
☆126Updated 2 years ago
Alternatives and similar repositories for SVAMP
Users that are interested in SVAMP are comparing it to the libraries listed below
Sorting:
- ☆48Updated last year
- ☆63Updated 2 years ago
- ☆175Updated 9 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆130Updated last year
- ☆18Updated last year
- ☆86Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆80Updated 2 years ago
- Implementations of six Machine Learning Classifiers using only standard Python libraries such as NumPy and Pandas.☆8Updated 4 years ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆53Updated last week
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆24Updated last year
- MWPToolkit is an open-source framework for math word problem(MWP) solvers.☆163Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 2 months ago
- ☆174Updated 2 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆37Updated last year
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆100Updated 2 years ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152Updated 2 years ago
- ☆82Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆137Updated 3 years ago
- ☆28Updated last year
- ☆13Updated last year
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆56Updated 2 years ago
- paper list on reasoning in NLP☆189Updated last month
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆52Updated 2 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆69Updated last year
- Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)☆79Updated 3 years ago
- ☆50Updated last year
- A unified benchmark for math reasoning☆88Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆93Updated 3 months ago