Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021
☆61May 10, 2021Updated 4 years ago
Alternatives and similar repositories for Mask-Align
Users that are interested in Mask-Align are comparing it to the libraries listed below
Sorting:
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆19Feb 19, 2023Updated 3 years ago
- ☆23Nov 15, 2022Updated 3 years ago
- Learn Classical Statistical Machine Translation Systems.☆18May 27, 2020Updated 5 years ago
- ☆21May 30, 2022Updated 3 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Jul 16, 2021Updated 4 years ago
- ☆14Aug 6, 2022Updated 3 years ago
- A accurate multilingual word aligner based on LaBSE☆24Oct 25, 2023Updated 2 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Jun 12, 2023Updated 2 years ago
- A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.☆23Sep 29, 2024Updated last year
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- ☆38Jun 3, 2021Updated 4 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- Python library to use Google Transliterate API which powers the G Input Tools☆22Mar 4, 2021Updated 4 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Dec 14, 2022Updated 3 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Data Augmentation for Neural Machine Translation☆32Nov 8, 2017Updated 8 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- ☆25Oct 22, 2022Updated 3 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 10 months ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆32Jul 16, 2022Updated 3 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- ☆11Jul 17, 2021Updated 4 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- Learning to Copy for Automatic Post-Editing (EMNLP 2019)☆11May 6, 2021Updated 4 years ago
- ☆120Dec 21, 2021Updated 4 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 4 years ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Jun 23, 2021Updated 4 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- A simple, Python-based, command-line runner for MGIZA++.☆10Mar 24, 2022Updated 3 years ago
- This code helps to retrieve all papers from conferences and rank them by the number of (Google Scholar) citations.☆12Dec 12, 2021Updated 4 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago