facebookresearch / bitext-lexindLinks
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆18Updated 4 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below
Sorting:
- Scripts to preprocess training and test data and to run fast_align and giza☆107Updated 4 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98Updated 5 years ago
- Terminology Dataset☆23Updated 5 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Updated 6 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 7 years ago
- ☆26Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- NMT domain adaptation papers (updating...)☆17Updated 6 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆94Updated 7 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Updated 3 years ago
- ☆33Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Updated 5 years ago
- ☆18Updated 5 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 4 years ago
- Soft Contextual Data Augmentation☆39Updated last year
- ☆29Updated last year
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczys…☆51Updated 6 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 4 years ago
- ☆36Updated 3 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆157Updated 3 years ago
- Record my paper reading about Machine Translation and other related works.☆36Updated 4 years ago
- ☆45Updated 4 years ago
- ☆120Updated 5 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 5 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated 2 years ago
- ☆50Updated 4 years ago
- Convert Standard M2 format to parallel sentences.☆22Updated 5 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 3 years ago