ozyyshr / ShareGPT_investigationLinks
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))
☆13Updated last year
Alternatives and similar repositories for ShareGPT_investigation
Users that are interested in ShareGPT_investigation are comparing it to the libraries listed below
Sorting:
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Updated last year
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆21Updated 3 years ago
- ☆17Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- ☆15Updated 4 years ago
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆28Updated 2 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆22Updated last year
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- Constrained Decoding Project☆17Updated last year
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Updated last year
- ☆82Updated 2 years ago
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- ☆51Updated 2 years ago
- ☆21Updated 3 years ago
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14Updated 2 years ago
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆26Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆58Updated 2 years ago
- WinoWhy provides human-annotated reasons for answering WSC questions.☆18Updated 5 years ago
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Updated 3 years ago
- ☆18Updated 4 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆42Updated 2 years ago
- ☆12Updated 2 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated 2 years ago
- ☆58Updated 3 years ago
- ☆24Updated 2 years ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆56Updated 2 years ago
- public repo for ESTER dataset and modeling (EMNLP'21)☆20Updated 3 years ago
- ☆39Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 5 months ago