facebookresearch / bitext-lexindLinks
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆18Updated 4 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below
Sorting:
- Scripts to preprocess training and test data and to run fast_align and giza☆107Updated 4 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98Updated 5 years ago
- Terminology Dataset☆23Updated 5 years ago
- ☆25Updated 2 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆93Updated 7 years ago
- ☆33Updated 4 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 7 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 4 years ago
- ☆18Updated 5 years ago
- NMT domain adaptation papers (updating...)☆17Updated 6 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 5 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Updated 5 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 4 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- ☆50Updated 4 years ago
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczys…☆51Updated 6 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 5 years ago
- ☆45Updated 4 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Updated 2 years ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆85Updated 5 years ago
- ☆120Updated 5 years ago
- ☆15Updated 4 years ago
- ☆36Updated 3 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆155Updated 3 years ago
- ☆29Updated last year
- Soft Contextual Data Augmentation☆38Updated last year
- Implementation of DTMT with incremental decoding☆13Updated 4 years ago
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Updated 5 years ago