keanudicap / MSQALinks
Microsoft question-answering dataset
☆10Updated 2 years ago
Alternatives and similar repositories for MSQA
Users that are interested in MSQA are comparing it to the libraries listed below
Sorting:
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆138Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆74Updated 3 years ago
- Repository for Decomposed Prompting☆93Updated last year
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆48Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆97Updated last month
- ☆46Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆81Updated last year
- ☆88Updated 2 years ago
- ☆189Updated 4 months ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆23Updated 3 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40Updated 2 years ago
- ☆38Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆96Updated 2 years ago
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆78Updated 5 months ago
- ☆28Updated last year
- "Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020☆41Updated 4 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆85Updated 2 years ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆163Updated last year
- Code for Editing Factual Knowledge in Language Models☆142Updated 3 years ago
- Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"☆25Updated 6 months ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated 2 years ago
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Updated 4 years ago
- Code & Data for Fact-based Text Editing (Iso et al; ACL 2020)☆18Updated last year
- ☆61Updated 3 years ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆64Updated 2 years ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆54Updated 5 years ago
- ☆68Updated 3 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- The codes for our ACL'22 paper: PRBOOST: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning.☆35Updated 3 years ago
- ☆19Updated 3 years ago