deep-spin / scheduled-sampling-transformers
Code for the paper "Scheduled Sampling for Transformers"
☆25Updated 5 years ago
Alternatives and similar repositories for scheduled-sampling-transformers:
Users that are interested in scheduled-sampling-transformers are comparing it to the libraries listed below
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 4 years ago
- Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>☆41Updated 4 years ago
- ☆50Updated last year
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆33Updated 9 months ago
- Code for the ACL 2019 paper ``A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer``☆45Updated 7 months ago
- Contrastive Attention Mechanism for Abstractive Text Summarization☆40Updated 5 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Updated 5 years ago
- HRED VHRED VHCR for Multi-Turn Dialogue Systems☆42Updated 5 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Updated 2 years ago
- A pytorch implementation for the MMI-anti model☆33Updated 6 years ago
- semi-autoregressive neural machine translation☆23Updated 6 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆58Updated 2 years ago
- ☆38Updated 5 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 2 years ago
- Implementation of latent-GLAT (ACL-2022)☆33Updated 2 years ago
- This is the pytorch implementation of the long paper on ACL 2020: A Self-Training Method for Machine Reading Comprehension with Soft Evid…☆35Updated 4 years ago
- ☆18Updated 7 months ago
- ☆75Updated 2 years ago
- ☆24Updated 5 years ago
- Dataset for WWW 2020 paper "Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog"☆37Updated 3 years ago
- ☆40Updated 3 years ago
- Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation☆93Updated 3 years ago
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆43Updated last year
- Deep Unknown Intent Detection with Margin Loss (ACL2019)☆34Updated 2 years ago
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆115Updated 4 years ago
- Paradigm shift in natural language processing☆42Updated 2 years ago