MGheini / xattn-transfer-for-mt
Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021
☆27Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for xattn-transfer-for-mt
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆22Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆18Updated 2 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆53Updated 5 months ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆20Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated last year
- Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".☆21Updated 2 years ago
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆38Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆26Updated 5 months ago
- ☆16Updated 2 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 3 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Updated 2 years ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Updated last year
- [NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"☆18Updated 2 years ago
- The code for lifelong few-shot language learning☆53Updated 2 years ago
- ICLR 2022☆17Updated 2 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆29Updated last year
- ☆22Updated 3 years ago
- ☆21Updated 3 years ago
- ☆28Updated 2 years ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆36Updated last year
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Updated 2 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 3 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆15Updated last year
- ☆15Updated 2 years ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆18Updated last year
- [EMNLP 2021] PyTorch Implementation of Contrastive Domain Adaptation for Question Answering using Limited Text Corpora☆14Updated last year
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Updated 2 years ago