keanudicap / MSQALinks
Microsoft question-answering dataset
☆10Updated 2 years ago
Alternatives and similar repositories for MSQA
Users that are interested in MSQA are comparing it to the libraries listed below
Sorting:
- Repository for Decomposed Prompting☆95Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆138Updated last year
- ☆38Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆96Updated 2 years ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆23Updated 3 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- ☆45Updated last year
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated last month
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆81Updated 3 months ago
- ☆88Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆83Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated last year
- Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency☆36Updated 9 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆64Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆102Updated 8 months ago
- Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"☆24Updated 5 months ago
- Code & Data for Fact-based Text Editing (Iso et al; ACL 2020)☆18Updated last year
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Updated 3 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20Updated 3 years ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆55Updated 3 weeks ago
- ☆67Updated 3 years ago
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆32Updated 4 years ago
- ☆52Updated 2 years ago
- ☆126Updated 2 years ago
- Code and data of KDD'21 paper "Table2Charts: Recommending Charts by Learning Shared Table Representations"☆33Updated 3 years ago
- ☆82Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated 2 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆84Updated 2 years ago