Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021
☆33Sep 15, 2021Updated 4 years ago
Alternatives and similar repositories for xattn-transfer-for-mt
Users that are interested in xattn-transfer-for-mt are comparing it to the libraries listed below
Sorting:
- codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)☆20Aug 31, 2021Updated 4 years ago
- Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"☆21Jun 23, 2020Updated 5 years ago
- Repository for the WACV 2024 paper "PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait"☆13Feb 22, 2024Updated 2 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 2 months ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- ☆167Dec 24, 2021Updated 4 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- ☆39Jul 25, 2024Updated last year
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- ☆13Jul 26, 2021Updated 4 years ago
- ☆31Apr 27, 2022Updated 3 years ago
- ☆19Jun 26, 2021Updated 4 years ago
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Oct 19, 2022Updated 3 years ago
- ☆18Jul 25, 2024Updated last year
- ☆20Dec 31, 2020Updated 5 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Jul 23, 2021Updated 4 years ago
- ☆55Apr 26, 2022Updated 3 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Aug 8, 2025Updated 7 months ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 5 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆33Jun 9, 2024Updated last year
- How to finetune mbart using fairseq☆25Dec 17, 2020Updated 5 years ago
- 14 million, semi-supervised, mental disorder detection data.☆13Oct 23, 2024Updated last year
- The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …☆63Nov 20, 2020Updated 5 years ago
- Replication attempt for the Protein Folding Model described in https://www.biorxiv.org/content/10.1101/2021.08.02.454840v1☆36May 19, 2022Updated 3 years ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- ☆25Oct 22, 2022Updated 3 years ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 2 years ago
- Hierarchical Question-Image Co-Attention for Visual Question Answering☆24Jun 2, 2019Updated 6 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- ☆144Jul 21, 2024Updated last year
- ☆68Aug 29, 2024Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- Analyzing Uncertainty in Neural Machine Translation☆34Sep 15, 2021Updated 4 years ago
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Oct 6, 2023Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Jun 12, 2024Updated last year