Multilingual Large Language Models Evaluation Benchmark
☆132Aug 21, 2024Updated last year
Alternatives and similar repositories for mlmm-evaluation
Users that are interested in mlmm-evaluation are comparing it to the libraries listed below
Sorting:
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Aug 18, 2023Updated 2 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago
- Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation☆15Jul 19, 2021Updated 4 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 2 years ago
- ☆92Dec 23, 2024Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆72Mar 6, 2024Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆95Aug 15, 2023Updated 2 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 10 months ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 6 months ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆22Mar 20, 2024Updated last year
- ☆23Jan 25, 2023Updated 3 years ago
- Common tools for data processing☆22Dec 8, 2025Updated 2 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 6 months ago
- ☆47Jun 20, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Feb 24, 2026Updated last week
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Aug 8, 2025Updated 6 months ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆10Oct 17, 2022Updated 3 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 2 months ago
- ☆11Dec 8, 2022Updated 3 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆18Oct 26, 2024Updated last year
- ☆266Aug 1, 2025Updated 7 months ago
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Aug 29, 2018Updated 7 years ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆33Jun 9, 2024Updated last year
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Sep 30, 2022Updated 3 years ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 6 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago