jaketae / param-share-transformerLinks
PyTorch implementation of Lessons on Parameter Sharing across Layers in Transformers
☆26Updated 4 years ago
Alternatives and similar repositories for param-share-transformer
Users that are interested in param-share-transformer are comparing it to the libraries listed below
Sorting:
- ☆28Updated 3 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆74Updated 2 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- ☆44Updated 4 years ago
- Implementation of RealFormer using pytorch☆100Updated 4 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58Updated 5 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated last year
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆24Updated 3 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- DisCo Transformer for Non-autoregressive MT☆77Updated 2 years ago
- Implementation of Mixout with PyTorch☆75Updated 2 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"☆24Updated 3 years ago
- ☆12Updated last year
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆73Updated 2 years ago
- Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.☆20Updated 2 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆17Updated 2 years ago
- Hard-Coded Gaussian Attention for Neural Machine Translation☆36Updated 2 years ago
- Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655☆21Updated 11 months ago
- DSTC9 Submission☆18Updated 4 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆54Updated 2 years ago
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆56Updated 3 years ago
- Zero -- A neural machine translation system☆152Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated 2 years ago
- Systems submitted to IWSLT 2021 by the MT-UPC group.☆14Updated 2 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Updated 5 years ago