rowanz / turingadviceLinks
Evaluating Machines by their Real-World Language Use
☆33Updated 2 years ago
Alternatives and similar repositories for turingadvice
Users that are interested in turingadvice are comparing it to the libraries listed below
Sorting:
- Repository for the Question Answering via Sentence Composition (QASC) dataset☆56Updated 2 years ago
- ☆49Updated 2 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Updated 3 years ago
- ☆62Updated 3 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated last year
- FaVIQ: Fact Verification from Information-seeking Questions☆43Updated 2 years ago
- ☆49Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆92Updated 3 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Updated 2 years ago
- ☆46Updated 5 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆119Updated 3 years ago
- Repository for the CODAH dataset☆22Updated 2 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- ☆42Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Updated 4 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆98Updated 2 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning☆141Updated last year
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 4 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 2 years ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆122Updated 2 years ago
- ☆50Updated 3 years ago
- ☆31Updated 5 years ago
- ☆21Updated 2 years ago
- MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems☆67Updated 3 years ago
- Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"☆79Updated 4 years ago