Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆18Jun 1, 2021Updated 4 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jan 29, 2021Updated 5 years ago
- ☆18Nov 25, 2020Updated 5 years ago
- ☆12Nov 3, 2024Updated last year
- Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"☆20Aug 5, 2020Updated 5 years ago
- ☆41Mar 8, 2021Updated 5 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLP☆53May 27, 2022Updated 3 years ago
- some python scripts for Stock and Funds☆11Sep 13, 2018Updated 7 years ago
- ☆15Oct 30, 2021Updated 4 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- ☆14Dec 7, 2020Updated 5 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- ☆13Jul 26, 2021Updated 4 years ago
- ☆12Nov 18, 2020Updated 5 years ago
- AES - Ancient Egyptian Sentences; Corpus of Ancient Egyptian sentences for corpus-linguistic research☆10May 18, 2021Updated 4 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Chinese Medical Dialogue Dataset for COVID19 Consultant☆18Apr 8, 2020Updated 5 years ago
- A public dataset containing chord/beat annotation from a music game named 'osu!'.☆11Oct 17, 2017Updated 8 years ago
- ☆12Mar 12, 2022Updated 4 years ago
- ☆16Apr 9, 2021Updated 4 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆12Oct 10, 2020Updated 5 years ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 2 years ago
- ☆10May 27, 2024Updated last year
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- An Empirical Comparison of Unsupervised Constituency Parsing Methods☆14Aug 15, 2021Updated 4 years ago
- ☆58Jul 9, 2024Updated last year
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago
- Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.☆13Jun 7, 2022Updated 3 years ago
- ☆12Aug 31, 2021Updated 4 years ago
- "Learning Rhyming Constraints using Structured Adversaries. Jhamtani H., Mehta S., Carbonell J., Berg-Kirkpatrick T. EMNLP-IJCNLP (Short …☆11Mar 17, 2020Updated 6 years ago
- uncover old chinese textual parallels based on sound☆15Feb 23, 2026Updated last month
- Filling the Gaps in Ancient Akkadian Texts:A Masked Language Modelling Approach, Lazar et al., EMNLP 2021☆13Nov 10, 2022Updated 3 years ago
- Things I care about☆13Jul 10, 2022Updated 3 years ago
- Unsupervised parallel sentence extraction from comparable corpora☆16Aug 6, 2019Updated 6 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- code for "GLEN: General-Purpose Event Detection for Thousands of Types"☆13Nov 6, 2023Updated 2 years ago