mbzuai-nlp / ArabicMMLULinks

☆27

Alternatives and similar repositories for ArabicMMLU

Users that are interested in ArabicMMLU are comparing it to the libraries listed below

Sorting:

nlp-uoregon / mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
☆127Updated 10 months ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆93Updated last year
nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆97Updated last year
mbzuai-nlp / bactrian-x
A Multilingual Replicable Instruction-Following Model
☆94Updated 2 years ago
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆145Updated 8 months ago
kaistAI / LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆90Updated 8 months ago
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆103Updated last year
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆162Updated 3 weeks ago
gsarti / pecore
Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑
☆15Updated last year
allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆221Updated 8 months ago
jakespringer / echo-embeddings
☆151Updated last year
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago
chaitanyamalaviya / ExpertQA
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆131Updated last year
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 4 months ago
xlang-ai / BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆150Updated last month
epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆78Updated last year
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆229Updated 8 months ago
ParticleMedia / RAGTruth
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆191Updated 7 months ago
qinyiwei / InfoBench
☆55Updated 10 months ago
chujiezheng / LLM-Extrapolation
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆74Updated last month
dadelani / sib-200
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆21Updated 5 months ago
kevinwu23 / StanfordClashEval
☆36Updated 6 months ago
AIR-Bench / AIR-Bench
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆148Updated 7 months ago
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆85Updated last year
RUCAIBox / Language-Specific-Neurons
☆75Updated 6 months ago
neulab / data-agora
[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆33Updated 7 months ago
wxjiao / ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…
☆176Updated 6 months ago
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains
☆149Updated last month
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆114Updated last year