ZurichNLP / multilingual-instruction-tuningView external linksLinks
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆26Jun 3, 2025Updated 8 months ago
Alternatives and similar repositories for multilingual-instruction-tuning
Users that are interested in multilingual-instruction-tuning are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated last year
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Apr 14, 2025Updated 10 months ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- Instruction Following Eval☆15Jan 16, 2025Updated last year
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆23Dec 10, 2025Updated 2 months ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆19Apr 21, 2025Updated 9 months ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- A library for minimum Bayes risk (MBR) decoding☆51Nov 2, 2025Updated 3 months ago
- A lightweight, user-friendly data-plane for LLM training.☆38Sep 10, 2025Updated 5 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Aug 8, 2025Updated 6 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆28Jan 19, 2025Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated 11 months ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 9 months ago
- ☆34Nov 15, 2023Updated 2 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 5 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆72Mar 6, 2024Updated last year
- ☆35Jun 15, 2023Updated 2 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 4 months ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- This repository contains codes for *Sem 2023 paper “Generative Data Augmentation for Aspect Sentiment Quad Prediction”.☆11May 30, 2023Updated 2 years ago
- ☆12Jan 15, 2015Updated 11 years ago
- ☆11May 18, 2022Updated 3 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ☆10Oct 2, 2024Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated 3 weeks ago
- ☆39Jan 23, 2024Updated 2 years ago
- Evaluation Pipeline for medical tasks.☆12Updated this week
- Repository containing the open source code of works published at the FBK MT unit.☆59Jan 16, 2026Updated 3 weeks ago