fe1ixxu / BiBERT
This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".
☆32Updated 2 years ago
Alternatives and similar repositories for BiBERT:
Users that are interested in BiBERT are comparing it to the libraries listed below
- ☆44Updated 3 years ago
- A repository with the code related to experiments around context-aware machine translation☆48Updated 2 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆25Updated 6 months ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆81Updated last year
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Updated last year
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- ☆84Updated 2 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- ☆13Updated 3 years ago
- ☆33Updated 3 years ago
- ☆36Updated 3 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Updated last year
- ☆20Updated 2 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- ☆23Updated 2 years ago
- ☆24Updated 2 years ago
- Code base for "G-Transformer for Document-level Machine Translation"☆44Updated last year
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Updated 3 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 4 years ago
- ☆25Updated 2 years ago
- Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation☆43Updated last year
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆30Updated 2 years ago
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23Updated last year
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Updated 4 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- code and data for paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"☆24Updated 2 years ago
- How to finetune mbart using fairseq☆21Updated 4 years ago
- ☆32Updated 2 years ago
- The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …☆62Updated 4 years ago
- ☆14Updated 2 years ago