MGheini/xattn-transfer-for-mt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MGheini/xattn-transfer-for-mt)

MGheini / xattn-transfer-for-mt

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

☆33

Alternatives and similar repositories for xattn-transfer-for-mt

Users that are interested in xattn-transfer-for-mt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Helsinki-NLP / MuCoW
View on GitHub
Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation
☆18Jan 18, 2021Updated 5 years ago
CONE-MT / Lego-MT
View on GitHub
☆10Mar 22, 2024Updated 2 years ago
Mlair77 / nmt_adequacy
View on GitHub
☆13Jul 26, 2021Updated 5 years ago
yunsukim86 / sockeye-transfer
View on GitHub
Transfer learning for neural machine translation using cross-lingual word embeddings
☆10Dec 17, 2025Updated 7 months ago
gpengzhi / CrossConST-MT
View on GitHub
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …
☆10Jul 18, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
cosmaadrian / psymo
View on GitHub
Repository for the WACV 2024 paper "PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait"
☆14Feb 22, 2024Updated 2 years ago
rbawden / mt-bigscience
View on GitHub
Evaluation results for Machine Translation within the BigScience project
☆11May 15, 2023Updated 3 years ago
NLP-Playground / LaSS
View on GitHub
☆31Apr 27, 2022Updated 4 years ago
linzehui / mRASP
View on GitHub
☆167Dec 24, 2021Updated 4 years ago
pppa2019 / swie_overmiss_llm4mt
View on GitHub
Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"
☆12Aug 26, 2023Updated 2 years ago
haorannlp / mix
View on GitHub
Code for "Mixed Cross Entropy Loss for Neural Machine Translation"
☆20Jul 23, 2021Updated 5 years ago
tmramalho / finetune-mbart
View on GitHub
How to finetune mbart using fairseq
☆25Dec 17, 2020Updated 5 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
ImperialNLP / BertGen
View on GitHub
Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)
☆11Sep 17, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
engindeniz / vitis
View on GitHub
[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
☆13Jan 13, 2025Updated last year
MichaelZhouwang / VLUE
View on GitHub
This repo contains codes and instructions for baselines in the VLUE benchmark.
☆41Jul 16, 2022Updated 4 years ago
dgliu / WSDM24_MultiFS
View on GitHub
Experiments codes for WSDM '24 paper "MultiFS: Automated Multi-Scenario Feature Selection in Deep Recommender Systems"
☆11May 31, 2024Updated 2 years ago
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
microsoft / Efficient-Large-LM-Trainer
View on GitHub
☆39Jul 25, 2024Updated 2 years ago
ekinakyurek / lexical
View on GitHub
Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling
☆17Jan 8, 2022Updated 4 years ago
neulab / contextual-mt
View on GitHub
A repository with the code related to experiments around context-aware machine translation
☆51Sep 22, 2025Updated 10 months ago
songmzhang / CBMI
View on GitHub
The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..
☆14Aug 6, 2022Updated 3 years ago
uhermjakob / utoken
View on GitHub
universal tokenizer
☆17Nov 29, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
Betswish / Cross-Lingual-Consistency
View on GitHub
Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…
☆28Aug 8, 2025Updated 11 months ago
gabrielStanovsky / mt_gender
View on GitHub
☆55Apr 26, 2022Updated 4 years ago
tnq177 / witwicky
View on GitHub
Witwicky: An implementation of Transformer in PyTorch.
☆22Aug 17, 2020Updated 5 years ago
ranqiu92 / RecoverSAT
View on GitHub
☆18Jul 25, 2024Updated 2 years ago
PANXiao1994 / mRASP2
View on GitHub
☆120Dec 21, 2021Updated 4 years ago
Anagabrielamantilla / MineralProspectivityPrediction
View on GitHub
☆11May 4, 2024Updated 2 years ago
zzeng13 / DISC
View on GitHub
Automatic Idiomatic Expression Detection
☆13Sep 26, 2021Updated 4 years ago
enyac-group / Renofeation
View on GitHub
☆19Jun 26, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TARGET-SIDE-DATA-AUG / TSDASG
View on GitHub
Source Code for <Target-Side Data Augmentation for Sequence Generation>
☆12Oct 6, 2021Updated 4 years ago
yoonkim / neural-qcfg
View on GitHub
☆45Oct 11, 2021Updated 4 years ago
tnq177 / improving_lexical_choice_in_nmt
View on GitHub
☆18Jul 30, 2018Updated 7 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
thunlp / MoEfication
View on GitHub
☆146Jul 21, 2024Updated 2 years ago
SZU-AdvTech-2022 / 376-HyGCN-A-GCN-Accelerator-with-Hybrid-Architecture
View on GitHub
☆12Mar 14, 2023Updated 3 years ago
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago