facebookresearch / bitext-lexind
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆16Updated 3 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below
Sorting:
- ☆24Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- Terminology Dataset☆23Updated 5 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆47Updated 7 years ago
- NMT domain adaptation papers (updating...)☆17Updated 5 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23Updated 3 years ago
- ☆36Updated 2 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 5 years ago
- ☆33Updated 3 years ago
- ☆28Updated 11 months ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- ☆44Updated 3 years ago
- ☆50Updated 3 years ago
- ☆37Updated 3 years ago
- ☆21Updated 2 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated last year
- ☆22Updated 6 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Updated 3 years ago
- Instruction to data diversification☆25Updated 4 years ago
- [ACL'21] Data for "An In-depth Study on Internal Structure of Chinese Words".☆14Updated 3 years ago
- Implementation of DTMT with incremental decoding☆13Updated 4 years ago
- ☆20Updated 4 years ago
- ☆15Updated 3 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Updated 2 years ago
- ☆20Updated 2 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Updated 2 years ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Updated 3 years ago