Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆26Jun 3, 2025Updated 11 months ago
Alternatives and similar repositories for multilingual-instruction-tuning
Users that are interested in multilingual-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆12May 27, 2023Updated 2 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated last year
- Curriculum training☆22Jun 25, 2025Updated 10 months ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Instruction Following Eval☆17Jan 16, 2025Updated last year
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆24Dec 10, 2025Updated 4 months ago
- Word Sense Linking model is designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.☆13Aug 23, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tutorials on how to Query Data☆13Jan 7, 2023Updated 3 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Apr 22, 2026Updated 2 weeks ago
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- ☆15Dec 26, 2024Updated last year
- A library for minimum Bayes risk (MBR) decoding☆52Nov 2, 2025Updated 6 months ago
- Python Web Scraper☆13Mar 7, 2018Updated 8 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- A package containing utils for the PyTorch version of the Tapas algorithm.☆11Apr 29, 2021Updated 5 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code release for "SuperBPE: Space Travel for Language Models"☆91Jan 9, 2026Updated 4 months ago
- [ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages☆106Apr 14, 2026Updated 3 weeks ago
- Discovery of Rhyme Schemes in Poetry☆17Nov 22, 2011Updated 14 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆39Oct 7, 2025Updated 7 months ago
- A LangGraph-powered agent that finds and analyzes similar companies, leveraging Qdrant for data storage, Exa for research, and Gmail for …☆20Feb 26, 2026Updated 2 months ago
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆28Jan 19, 2025Updated last year
- Evaluating Reward Models in Multilingual Settings (ACL Main '25)☆42May 16, 2025Updated 11 months ago
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- ☆40Jan 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Feb 13, 2023Updated 3 years ago
- Interview-based evaluation of LLMs☆27Jan 8, 2025Updated last year
- Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only☆13Sep 15, 2020Updated 5 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 11 months ago
- A lightweight, user-friendly data-plane for LLM training.☆39Sep 10, 2025Updated 7 months ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 8 months ago