facebookresearch / bitext-lexind
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆16Updated 3 years ago
Alternatives and similar repositories for bitext-lexind:
Users that are interested in bitext-lexind are comparing it to the libraries listed below
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 7 years ago
- ☆44Updated 3 years ago
- ☆24Updated 2 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆81Updated last year
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- NMT domain adaptation papers (updating...)☆17Updated 5 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- ☆21Updated 2 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆36Updated 2 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Updated 2 years ago
- ☆28Updated 9 months ago
- Implementation of DTMT with incremental decoding☆13Updated 4 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- Terminology Dataset☆23Updated 5 years ago
- ☆33Updated 3 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆58Updated 2 years ago
- A repository with the code related to experiments around context-aware machine translation☆48Updated 2 years ago
- ☆13Updated 3 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Updated last year
- ☆20Updated 2 years ago
- ☆37Updated 3 years ago
- Record my paper reading about Machine Translation and other related works.☆36Updated 3 years ago
- ☆20Updated 4 years ago
- Source code for the AAAI 2020 long paper <Modeling Fluency and Faithfulness for Diverse Neural Machine Translation>.☆19Updated 5 years ago
- Instruction to data diversification☆25Updated 4 years ago