Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆18Jun 1, 2021Updated 5 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"☆14May 30, 2021Updated 5 years ago
- ☆18Nov 25, 2020Updated 5 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- Source Code for ACL 2020 paper, "Rationalizing Medical Relation Prediction from Corpus-level Statistics"☆11Sep 6, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- ☆15Nov 3, 2024Updated last year
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 5 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"☆20Aug 5, 2020Updated 5 years ago
- ☆42Mar 8, 2021Updated 5 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLP☆53May 27, 2022Updated 4 years ago
- ☆15Oct 30, 2021Updated 4 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆19Sep 29, 2024Updated last year
- Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…☆10Nov 7, 2021Updated 4 years ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- ☆14Dec 7, 2020Updated 5 years ago
- ☆12Nov 18, 2020Updated 5 years ago
- AES - Ancient Egyptian Sentences; Corpus of Ancient Egyptian sentences for corpus-linguistic research☆10May 18, 2021Updated 5 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 3 years ago
- Chinese Medical Dialogue Dataset for COVID19 Consultant☆18Apr 8, 2020Updated 6 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆21Jun 12, 2023Updated 3 years ago
- ☆12Mar 12, 2022Updated 4 years ago
- Evaluation of Natural Language Processing (NLP) tools for the Ancient Chinese language☆47Mar 15, 2026Updated 2 months ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 3 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- An Empirical Comparison of Unsupervised Constituency Parsing Methods☆14Aug 15, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 阿里天池智慧物流挑战赛-饿了吗新冠疫情骑士行为预估☆16Jun 3, 2020Updated 6 years ago
- Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.☆12Jun 7, 2022Updated 4 years ago
- ICLR 2022☆18Apr 15, 2022Updated 4 years ago
- ☆12Aug 31, 2021Updated 4 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆19Nov 7, 2024Updated last year
- "Learning Rhyming Constraints using Structured Adversaries. Jhamtani H., Mehta S., Carbonell J., Berg-Kirkpatrick T. EMNLP-IJCNLP (Short …☆11Mar 17, 2020Updated 6 years ago
- Filling the Gaps in Ancient Akkadian Texts:A Masked Language Modelling Approach, Lazar et al., EMNLP 2021☆14Nov 10, 2022Updated 3 years ago