Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"
☆11Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for ConTRoL-dataset
Users that are interested in ConTRoL-dataset are comparing it to the libraries listed below
Sorting:
- Textual Entailment Using Pytorch BERT pretrained model☆11Oct 17, 2022Updated 3 years ago
- TSQA: Tabular Scenario Based Question Answering (AAAI 2021)☆18Dec 17, 2020Updated 5 years ago
- ☆15Sep 21, 2021Updated 4 years ago
- ☆19Feb 3, 2022Updated 4 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆29Jul 24, 2023Updated 2 years ago
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆36Sep 18, 2025Updated 5 months ago
- a benchmark suite for testing logical reasoning abilities of prompt-based models☆32Nov 20, 2023Updated 2 years ago
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Modification of the original Mask/Faster R-CNN☆12Dec 13, 2020Updated 5 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model