jungokasai / deep-shallow
☆42Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep-shallow
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- ☆20Updated 3 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 2 years ago
- ☆22Updated 3 years ago
- ☆21Updated 2 years ago
- ☆20Updated 2 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆22Updated 4 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆21Updated 4 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆24Updated 4 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- DisCo Transformer for Non-autoregressive MT☆78Updated 2 years ago
- lanmt ebm☆11Updated 4 years ago
- ☆46Updated 2 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆15Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated last year
- Pytorch Seq2Seq framework☆26Updated 3 weeks ago
- ☆28Updated 2 years ago
- ☆17Updated 2 years ago
- ☆42Updated 3 years ago
- ☆44Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 3 years ago
- ☆63Updated 2 years ago
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Updated 3 years ago
- A repository for experiments in quality-aware decoding☆14Updated 2 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Updated last year