Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".
☆129Jun 30, 2021Updated 4 years ago
Alternatives and similar repositories for Distill-BERT-Textgen
Users that are interested in Distill-BERT-Textgen are comparing it to the libraries listed below
Sorting:
- Distilling BERT using natural language generation.☆39Aug 13, 2023Updated 2 years ago
- Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)☆16Mar 25, 2023Updated 2 years ago
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Oct 18, 2022Updated 3 years ago
- Code for EMNLP 2020 paper CoDIR☆41Oct 4, 2022Updated 3 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Jun 30, 2022Updated 3 years ago
- Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.☆46Aug 13, 2021Updated 4 years ago
- Simple Text Classification[WIP]☆11Dec 30, 2022Updated 3 years ago
- The official Keras implementation of ACL 2020 paper "Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-En…☆48Nov 4, 2022Updated 3 years ago
- ☆53Apr 29, 2020Updated 5 years ago
- Official Repository for "The Curious Case of Neural Text Degeneration"☆169Apr 18, 2023Updated 2 years ago
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Apr 12, 2020Updated 5 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…☆142Dec 30, 2021Updated 4 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- INSET: Sentence Infilling with Inter-sentential Transformer☆30Nov 21, 2020Updated 5 years ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆742Jul 25, 2024Updated last year
- ☆221Jun 8, 2020Updated 5 years ago
- BERT score for text generation☆1,876Jul 30, 2024Updated last year
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆57Jan 1, 2021Updated 5 years ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Nov 26, 2021Updated 4 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆786Aug 4, 2023Updated 2 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Jul 18, 2020Updated 5 years ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Oct 13, 2020Updated 5 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆302Mar 15, 2023Updated 2 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆185Jun 12, 2023Updated 2 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆539Nov 15, 2021Updated 4 years ago
- ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation☆99Jul 6, 2023Updated 2 years ago
- The source code of our ACL2019 paper "Incremental Transformer with Deliberation Decoder for Document Grounded Conversations "☆86Aug 30, 2019Updated 6 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 3 years ago
- ☆361Nov 22, 2022Updated 3 years ago
- Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆236Sep 16, 2021Updated 4 years ago
- Improving the Transformer translation model with document-level context☆170Jul 7, 2020Updated 5 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago