tdopierre / ProtAugment
Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning
☆21Updated 2 years ago
Related projects: ⓘ
- ☆37Updated last year
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆28Updated 3 years ago
- Training T5 to perform numerical reasoning.☆23Updated 3 years ago
- ☆66Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆64Updated 3 years ago
- ☆55Updated last year
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆27Updated 2 years ago
- ☆33Updated last year
- ☆13Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- ☆77Updated 4 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 2 years ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Updated 2 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆71Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆41Updated last month
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Ret…☆69Updated 2 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 2 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆19Updated 3 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆74Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆31Updated last year
- Pre-training BART in Flax on The Pile dataset☆20Updated 3 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆79Updated 3 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- ☆26Updated 8 months ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆24Updated 11 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆71Updated 2 years ago