ddkang / loss_dropper
☆51Updated 4 years ago
Alternatives and similar repositories for loss_dropper:
Users that are interested in loss_dropper are comparing it to the libraries listed below
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- Source code for Text Infilling, implemented with Texar.☆27Updated 6 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆25Updated 3 years ago
- DisCo Transformer for Non-autoregressive MT☆77Updated 2 years ago
- ☆20Updated 4 years ago
- ☆41Updated 4 years ago
- ☆42Updated 4 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- Posterior Control of Blackbox Generation☆23Updated 4 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- ☆12Updated 2 years ago
- Pytorch Seq2Seq framework☆26Updated 5 months ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆32Updated 3 years ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Updated 4 years ago
- ☆22Updated 3 years ago
- ☆29Updated 2 years ago
- An NMT framework built on Joint Representation☆12Updated 5 years ago
- ☆25Updated 3 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 3 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- ☆44Updated 4 years ago
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Updated 4 years ago
- ☆59Updated 2 years ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆24Updated 3 years ago
- Instruction to data diversification☆25Updated 4 years ago