ZurichNLP/multilingual-instruction-tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZurichNLP/multilingual-instruction-tuning)

ZurichNLP / multilingual-instruction-tuning

Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"

☆26

Alternatives and similar repositories for multilingual-instruction-tuning

Users that are interested in multilingual-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
hplt-project / OpusTrainer
View on GitHub
Curriculum training
☆22Jun 25, 2025Updated last year
ictnlp / SiLLM
View on GitHub
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…
☆18Feb 22, 2024Updated 2 years ago
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago
rbawden / mt-bigscience
View on GitHub
Evaluation results for Machine Translation within the BigScience project
☆11May 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZurichNLP / mbr
View on GitHub
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆61Jun 3, 2024Updated 2 years ago
ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆20Apr 18, 2026Updated 3 months ago
josejg / instruction_following_eval
View on GitHub
Instruction Following Eval
☆18Jan 16, 2025Updated last year
SeaEval / SeaEval
View on GitHub
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
☆26Mar 3, 2025Updated last year
VITA-Group / TAPE
View on GitHub
[ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…
☆15Jun 6, 2025Updated last year
ictnlp / ITST
View on GitHub
Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"
☆13Nov 3, 2022Updated 3 years ago
naist-nlp / mbrs
View on GitHub
A library for minimum Bayes risk (MBR) decoding
☆53Nov 2, 2025Updated 8 months ago
ahmetustun / hyperx
View on GitHub
☆21Dec 5, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Principled-Intelligence / orbitals
View on GitHub
☆17Jul 21, 2026Updated last week
OSU-STARLAB / Simul-LLM
View on GitHub
[ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.
☆18Apr 21, 2025Updated last year
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
cisnlp / Glot500
View on GitHub
[ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
☆107Apr 14, 2026Updated 3 months ago
NJUNLP / AdaR
View on GitHub
☆15Dec 8, 2025Updated 7 months ago
OpenBMB / UltraLink
View on GitHub
An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
☆27Jan 19, 2025Updated last year
steinst / SentAlign
View on GitHub
☆38Mar 16, 2026Updated 4 months ago
megagonlabs / llm-longeval
View on GitHub
💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)
☆11Feb 22, 2024Updated 2 years ago
AllanYangZhou / generative-invariance-transfer
View on GitHub
☆26Feb 27, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wmt-conference / wmt22-news-systems
View on GitHub
☆21Feb 13, 2023Updated 3 years ago
arteria / djangocms-inline-comment
View on GitHub
Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only
☆13Sep 15, 2020Updated 5 years ago
economia / DHondt
View on GitHub
Utility to compute number of mandates based on election results, uting D'Hondt method
☆11Sep 6, 2013Updated 12 years ago
zwhe99 / WMT22-En-Liv
View on GitHub
[WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian
☆23May 4, 2023Updated 3 years ago
nishantsubramani / steering_vectors
View on GitHub
Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings
☆11Mar 14, 2022Updated 4 years ago
simonZhou86 / en_dran
View on GitHub
Code for paper Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion
☆12Nov 18, 2024Updated last year
ZurichNLP / ContraDecode
View on GitHub
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆38Aug 29, 2025Updated 11 months ago
wmt-conference / wmt-format-tools
View on GitHub
Tools for formatting WMT hypothesis and test sets in XML
☆27Apr 18, 2025Updated last year
Betswish / Cross-Lingual-Consistency
View on GitHub
Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…
☆28Aug 8, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TMMMU-Benchmark / evaluation
View on GitHub
Evaluation code for benchmarking VLMs in traditional chinese understanding
☆14Dec 22, 2025Updated 7 months ago
0nutation / DUB
View on GitHub
Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)
☆29Jun 28, 2023Updated 3 years ago
huhailinguist / ChineseNLIProbing
View on GitHub
☆10Oct 17, 2021Updated 4 years ago
dhruvdcoder / xlm-core
View on GitHub
xLM is a modular, research-friendly framework for developing and comparing non-autoregressive language models. Built on PyTorch and PyTor…
☆28Updated this week
NJUNLP / MMT-LLM
View on GitHub
☆36Jun 15, 2023Updated 3 years ago
N-Almarwani / DCT_Sentence_Embedding
View on GitHub
Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform
☆17Jul 2, 2020Updated 6 years ago
microsoft / Multilingual-Evaluation-of-Generative-AI-MEGA
View on GitHub
Code for Multilingual Eval of Generative AI paper published at EMNLP 2023
☆72Mar 6, 2024Updated 2 years ago