alexandra-chron / relm_unmtView external linksLinks
Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".
☆35Mar 16, 2022Updated 3 years ago
Alternatives and similar repositories for relm_unmt
Users that are interested in relm_unmt are comparing it to the libraries listed below
Sorting:
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Nov 2, 2023Updated 2 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated last month
- Source code for the ACL 2019 paper "Attention-based Conditioning Methods for External Knowledge Integration"☆59Jun 21, 2022Updated 3 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Aug 8, 2025Updated 6 months ago
- ☆10Sep 13, 2022Updated 3 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Feb 1, 2020Updated 6 years ago
- PyTorch implementation of Variational LSTM and Monte Carlo dropout.☆56Jun 22, 2022Updated 3 years ago
- ☆15Mar 8, 2024Updated last year
- The website of the Oscar Project☆11Mar 27, 2025Updated 10 months ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Jun 12, 2021Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Jul 18, 2020Updated 5 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆20Jun 18, 2019Updated 6 years ago
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆18Sep 19, 2022Updated 3 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Dec 16, 2025Updated last month
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- Source code for "Improving Robustness of Neural Machine Translation with Multi-task Learning"☆19Aug 15, 2019Updated 6 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆386Nov 7, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- ☆25May 9, 2022Updated 3 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago
- ☆94Feb 13, 2024Updated 2 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- ☆29Jun 5, 2022Updated 3 years ago
- Implementation of “Unsupervised Neural Machine Translation with SMT as Posterior Regularization” (AAAI 2019)☆31Mar 27, 2019Updated 6 years ago
- Tools for training pytorch language models☆27Nov 14, 2020Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Aug 13, 2020Updated 5 years ago
- ☆26Updated this week
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- Code for the EMNLP 2021 Paper "Active Learning by Acquiring Contrastive Examples" & the ACL 2022 Paper "On the Importance of Effectively …☆127May 24, 2022Updated 3 years ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021☆36May 8, 2021Updated 4 years ago
- Finite-state script normalization and processing utilities☆46Jan 14, 2026Updated 3 weeks ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- CheTo - Chemical Topic Modeling☆33Apr 12, 2021Updated 4 years ago