jungokasai / deep-shallow
☆43Updated 4 years ago
Alternatives and similar repositories for deep-shallow:
Users that are interested in deep-shallow are comparing it to the libraries listed below
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- ☆22Updated 4 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆22Updated 3 years ago
- ☆21Updated 3 years ago
- ☆42Updated 4 years ago
- ☆21Updated 2 years ago
- ☆45Updated 3 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆15Updated 3 years ago
- ☆32Updated 3 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- DisCo Transformer for Non-autoregressive MT☆78Updated 2 years ago
- ☆29Updated 2 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 3 years ago
- Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"☆46Updated 2 years ago
- Pytorch Seq2Seq framework☆26Updated 4 months ago
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Updated 4 years ago
- lanmt ebm☆11Updated 4 years ago
- ☆17Updated 2 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- ☆46Updated 2 years ago
- ☆14Updated 3 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆23Updated 5 years ago
- ☆20Updated 4 years ago
- LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference☆79Updated 3 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated 2 years ago