liujch1998 / rainier
☆28Updated 7 months ago
Related projects: ⓘ
- ☆77Updated last year
- ☆57Updated last year
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆44Updated 9 months ago
- ☆80Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.