mbzuai-nlp / ArabicMMLULinks
☆28Updated last year
Alternatives and similar repositories for ArabicMMLU
Users that are interested in ArabicMMLU are comparing it to the libraries listed below
Sorting:
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆94Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆71Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated 10 months ago
- ☆89Updated 11 months ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Updated 10 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Updated last year
- ☆189Updated 5 months ago
- ☆53Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆134Updated last year
- Resources for cultural NLP research☆110Updated 2 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆215Updated last year
- Datasets for Instruction Tuning of Large Language Models☆259Updated 2 years ago
- ☆51Updated last year
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆176Updated 11 months ago
- The official evaluation suite and dynamic data release for MixEval.☆253Updated last year
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆123Updated last month
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆57Updated last year
- ☆57Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Updated 9 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆194Updated last year
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆128Updated last year
- contrastive decoding☆204Updated 3 years ago