Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsuper…
☆18Jun 1, 2021Updated 4 years ago
Alternatives and similar repositories for bitext-lexind
Users that are interested in bitext-lexind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"☆14May 30, 2021Updated 4 years ago
- ☆12Jan 29, 2021Updated 5 years ago
- ☆18Nov 25, 2020Updated 5 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source Code for ACL 2020 paper, "Rationalizing Medical Relation Prediction from Corpus-level Statistics"☆11Sep 6, 2020Updated 5 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- ☆13Nov 3, 2024Updated last year
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"☆20Aug 5, 2020Updated 5 years ago
- ☆41Mar 8, 2021Updated 5 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Jun 26, 2023Updated 2 years ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 6 months ago
- Code for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)☆17Jun 16, 2022Updated 3 years ago
- CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLP☆53May 27, 2022Updated 3 years ago
- ☆15Oct 30, 2021Updated 4 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Nov 2, 2022Updated 3 years ago
- ☆14Dec 7, 2020Updated 5 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jul 26, 2021Updated 4 years ago
- AES - Ancient Egyptian Sentences; Corpus of Ancient Egyptian sentences for corpus-linguistic research☆10May 18, 2021Updated 4 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- A public dataset containing chord/beat annotation from a music game named 'osu!'.☆11Oct 17, 2017Updated 8 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆20Jun 12, 2023Updated 2 years ago
- ☆12Mar 12, 2022Updated 4 years ago
- ☆16Apr 9, 2021Updated 5 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆12Oct 10, 2020Updated 5 years ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 3 years ago
- ☆10May 27, 2024Updated last year
- ☆17Nov 23, 2021Updated 4 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- An Empirical Comparison of Unsupervised Constituency Parsing Methods☆14Aug 15, 2021Updated 4 years ago
- 阿里天池智慧物流挑战赛-饿了吗新冠疫情骑士行为预估☆16Jun 3, 2020Updated 5 years ago