facebookresearch / bitext-lexind
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for bitext-lexind
- ☆24Updated last year
- ☆32Updated 3 years ago
- ☆44Updated 3 years ago
- ☆20Updated 3 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆35Updated 3 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆59Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆81Updated last year
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Updated 4 years ago
- Instruction to data diversification☆24Updated 3 years ago
- ☆16Updated 4 years ago
- Code base for "G-Transformer for Document-level Machine Translation"☆43Updated last year
- Tools for formatting WMT hypothesis and test sets in XML☆25Updated 3 months ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Updated 2 years ago
- ☆20Updated last year
- ☆15Updated 3 years ago
- Terminology Dataset☆23Updated 4 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated last year
- Implementation of DTMT with incremental decoding☆13Updated 3 years ago
- ☆28Updated 4 months ago
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆45Updated 6 years ago
- ☆50Updated 3 years ago
- NMT domain adaptation papers (updating...)☆17Updated 5 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆109Updated 3 years ago
- [EMNLP 2020] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading☆36Updated last year