arkilpatel / SVAMPLinks
NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?
☆130Updated 2 years ago
Alternatives and similar repositories for SVAMP
Users that are interested in SVAMP are comparing it to the libraries listed below
Sorting:
- ☆48Updated last year
- ☆87Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆24Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆69Updated last year
- ☆62Updated 2 years ago
- ☆82Updated 2 years ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆53Updated last month
- ☆176Updated 11 months ago
- ☆18Updated last year
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆38Updated last year
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆101Updated 2 years ago
- ☆63Updated 2 years ago
- ☆51Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆114Updated 11 months ago
- ☆36Updated 2 years ago
- ☆74Updated last year
- Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)☆79Updated 3 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆74Updated 2 years ago
- ☆28Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆153Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- ☆33Updated last year
- A comprehensive paper list of Reasoning over Tables.☆29Updated 2 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆81Updated 2 years ago
- Implementations of six Machine Learning Classifiers using only standard Python libraries such as NumPy and Pandas.☆8Updated 4 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆44Updated 8 months ago