alirezamshi-zz / small100Links

Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to EMNLP 2022.

☆23

Alternatives and similar repositories for small100

Users that are interested in small100 are comparing it to the libraries listed below

Sorting:

bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year
MicrosoftTranslator / NTREX
NTREX -- News Test References for MT Evaluation
☆84Updated last year
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
lgessler / microbert
A tiny BERT for low-resource monolingual models
☆31Updated 9 months ago
juletx / self-translate
Do Multilingual Language Models Think Better in English?
☆42Updated last year
alirezamshi / small100
Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…
☆27Updated 2 years ago
amazon-science / contrastive-controlled-mt
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆21Updated 2 years ago
google-research / mt-metrics-eval
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
☆110Updated 4 months ago
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆82Updated 10 months ago
EleanorJiang / BlonDe
Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …
☆77Updated last year
malteos / clp-transfer
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Updated 2 years ago
machelreid / m2d2
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Updated 2 years ago
ZurichNLP / nmtscore
A library of translation-based text similarity measures
☆25Updated last year
cindyxinyiwang / expand-via-lexicon-based-adaptation
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆30Updated 3 years ago
MicrosoftTranslator / GEMBA
GEMBA — GPT Estimation Metric Based Assessment
☆119Updated 11 months ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 2 years ago
ghrua / NgramRes
☆21Updated 2 years ago
laurieburchell / open-lid-dataset
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
☆74Updated 3 months ago
thevasudevgupta / transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs.
☆10Updated 4 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
bltlab / mot
Multilingual Open Text
☆25Updated 2 months ago
Rojak-NLP / LLM-Code-Mixing
Can LLMs generate code-mixed sentences through zero-shot prompting?
☆11Updated 2 years ago
ZurichNLP / multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆25Updated last month
ZurichNLP / ContraDecode
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆35Updated last year
google-research / url-nlp
☆215Updated 2 weeks ago
CZWin32768 / XLM-Align
☆36Updated 2 years ago
jungokasai / beam_with_patience
☆46Updated 3 years ago
mbzuai-nlp / bactrian-x
A Multilingual Replicable Instruction-Following Model
☆94Updated 2 years ago
naist-nlp / mbrs
A library for minimum Bayes risk (MBR) decoding
☆43Updated last month
Helsinki-NLP / OpusFilter
OpusFilter - Parallel corpus processing toolkit
☆105Updated 2 weeks ago