ddkang / loss_dropperLinks
☆53Updated 5 years ago
Alternatives and similar repositories for loss_dropper
Users that are interested in loss_dropper are comparing it to the libraries listed below
Sorting:
- DisCo Transformer for Non-autoregressive MT☆77Updated 3 years ago
- Posterior Control of Blackbox Generation☆23Updated 5 years ago
- Source code for Text Infilling, implemented with Texar.☆27Updated 6 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 5 years ago
- ☆98Updated 3 years ago
- ☆32Updated 4 years ago
- ☆41Updated 4 years ago
- NAACL 2021 - Progressive Generation of Long Text☆82Updated 5 years ago
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated last year
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Updated 3 years ago
- Pytorch Seq2Seq framework☆27Updated 3 weeks ago
- Consistent dialogue generation☆16Updated 3 years ago
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Updated 5 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 5 years ago
- ☆44Updated 5 years ago
- Pytorch implementation of "A Probabilistic Formulation of Unsupervised Text Style Transfer" by He. et. al. at ICLR 2020☆162Updated 3 years ago
- Domain Adaptive Text Style Transfer, EMNLP 2019☆70Updated 6 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 4 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆61Updated 3 years ago
- ☆42Updated 5 years ago
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Updated 5 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 3 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Updated 4 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Updated 3 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 4 years ago
- Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-…☆35Updated last year
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Updated 4 years ago
- ☆22Updated 4 years ago
- An NMT framework built on Joint Representation☆12Updated 5 years ago
- ☆12Updated 3 years ago