Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".
☆35Mar 16, 2022Updated 3 years ago
Alternatives and similar repositories for relm_unmt
Users that are interested in relm_unmt are comparing it to the libraries listed below
Sorting:
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.☆32Dec 27, 2022Updated 3 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 11 months ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Feb 1, 2020Updated 6 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Dec 9, 2020Updated 5 years ago
- The website of the Oscar Project☆11Mar 27, 2025Updated 11 months ago
- ☆15Mar 8, 2024Updated last year
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Jun 12, 2021Updated 4 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Code for the ACL2020 paper Character-Level Translation with Self-Attention☆31Oct 15, 2020Updated 5 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Jul 18, 2020Updated 5 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- ☆18Sep 26, 2020Updated 5 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Feb 27, 2026Updated last week
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 2 years ago
- ☆25Jan 22, 2024Updated 2 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Aug 25, 2020Updated 5 years ago
- LSTM and QRNN Language Model Toolkit for PyTorch 1.2.0!☆20Mar 2, 2020Updated 6 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Mar 5, 2022Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆137Oct 24, 2019Updated 6 years ago
- Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"☆19Aug 15, 2019Updated 6 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Sep 22, 2021Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆389Nov 7, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- Tools for training pytorch language models☆27Nov 14, 2020Updated 5 years ago
- OpenNMT Pytorch with BERT Embeddings☆24Sep 23, 2019Updated 6 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Aug 13, 2020Updated 5 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31May 11, 2020Updated 5 years ago
- Finite-state script normalization and processing utilities☆46Feb 25, 2026Updated last week
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- CheTo - Chemical Topic Modeling☆34Apr 12, 2021Updated 4 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago