ShiZhengyan / StepGameLinks
[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)
☆32Updated last year
Alternatives and similar repositories for StepGame
Users that are interested in StepGame are comparing it to the libraries listed below
Sorting:
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆72Updated 3 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆15Updated 2 years ago
- ☆50Updated last year
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆12Updated 2 years ago
- ☆87Updated 2 years ago
- ☆17Updated 4 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- ☆36Updated last year
- Codes for ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆31Updated 2 years ago
- ☆72Updated last year
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Updated 3 years ago
- ☆82Updated 2 years ago
- ☆46Updated 2 years ago
- ACL 2021☆26Updated 3 years ago
- Repository for Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS), accepted to NeurIPS 2021 Dataset…☆72Updated last year
- [ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering☆19Updated last month
- ☆128Updated last year
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40Updated 2 years ago
- ☆20Updated 2 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Updated last year
- ☆22Updated last year
- the Pytorch implementation for our EMNLP 2021 paper "Learning Neural Templates for Recommender Dialogue System"☆31Updated 3 years ago
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Updated last year
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆22Updated last year
- ☆45Updated last year
- Official repository for EMNLP'22 paper: Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering☆25Updated 2 years ago
- ☆35Updated 3 years ago
- "Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21☆34Updated 2 years ago
- ☆58Updated 3 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆111Updated 3 years ago