google-research-datasets / Disfl-QA
A Benchmark Dataset for Understanding Disfluencies in Question Answering
☆60Updated 3 years ago
Related projects: ⓘ
- ☆73Updated 3 years ago
- Build a dialog dataset from online books in many languages☆71Updated last year
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆66Updated 3 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 3 years ago
- Temporal Commonsense Reasoning in Dialog☆69Updated 3 years ago
- ☆55Updated 2 years ago
- ☆28Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆32Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆60Updated 2 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 4 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆71Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆70Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆70Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 2 years ago
- ☆37Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆63Updated 3 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22Updated 3 years ago
- Generative Retrieval Transformer☆29Updated last year
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆38Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆114Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- A guide to building language technology in new languages.☆57Updated 2 years ago