TianduoWang / MsATLinks

[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707

☆24

Alternatives and similar repositories for MsAT

Users that are interested in MsAT are comparing it to the libraries listed below

Sorting:

GXimingLu / Quark
☆75Updated 2 years ago
ellaneeman / disent_qa
This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
☆16Updated 2 years ago
Shark-NLP / EVALM
Official codebase for “In-Context Learning with Many Demonstration Examples”
☆16Updated 2 years ago
microsoft / advNLG
☆24Updated 3 years ago
OhadRubin / EPR
☆64Updated 2 years ago
liujch1998 / rainier
☆28Updated last year
arkilpatel / SVAMP
NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?
☆135Updated 3 years ago
dqxiu / CaliNet
☆32Updated 3 years ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆61Updated last year
NJUNLP / QAlign
☆38Updated last year
matt-seb-ho / WikiWhy
WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…
☆48Updated last year
Shivanshu-Gupta / icl-coverage
☆13Updated last year
nayeon7lee / FactualityPrompt
☆86Updated 3 years ago
FranxYao / FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.
☆132Updated 2 years ago
prakharguptaz / Instructdial
Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
☆100Updated 2 years ago
GXimingLu / neurologic_decoding
☆82Updated 2 years ago
Mayer123 / UDT-QA
Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)
☆56Updated 2 years ago
KaiLv69 / UDR
ACL'23: Unified Demonstration Retriever for In-Context Learning
☆37Updated last year
HappyGu0524 / MultiControl
☆42Updated 2 years ago
Shark-NLP / CoNT
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆153Updated 2 years ago
WadeYin9712 / GeoMLAMA
☆15Updated 3 years ago
sail-sg / symbolic-instruction-tuning
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆66Updated 2 years ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
KwanWaiChung / M4LE
Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
☆23Updated last year
dqxiu / PLMs-with-Knowledge
☆16Updated 3 years ago
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
Alrope123 / rethinking-demonstrations
☆177Updated last year
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Updated 2 years ago
taoyds / grappa
☆31Updated 4 years ago
GAIR-NLP / alignment-for-honesty
☆76Updated last year