mbzuai-nlp / ArabicMMLULinks
☆27Updated 8 months ago
Alternatives and similar repositories for ArabicMMLU
Users that are interested in ArabicMMLU are comparing it to the libraries listed below
Sorting:
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆122Updated 9 months ago
- ☆13Updated 2 weeks ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆25Updated 3 months ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆25Updated 5 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆38Updated 8 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated last year
- Code for Zero-Shot Tokenizer Transfer☆128Updated 4 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆89Updated 7 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated 2 weeks ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆52Updated last month
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 6 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆34Updated last year
- ☆17Updated last year
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 9 months ago
- ☆32Updated last year
- ☆53Updated 9 months ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆21Updated 4 months ago
- ☆42Updated last year
- ☆116Updated 6 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆57Updated 9 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆47Updated 6 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆157Updated 9 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆101Updated last year
- ☆124Updated last year
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆106Updated 3 months ago
- ☆75Updated 5 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆36Updated 10 months ago