☆29Sep 17, 2024Updated last year
Alternatives and similar repositories for ArabicMMLU
Users that are interested in ArabicMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"☆10Aug 2, 2024Updated last year
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆33Apr 18, 2026Updated last month
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- ☆128Mar 3, 2024Updated 2 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆19May 22, 2025Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- ☆12Jun 12, 2024Updated last year
- Multilingual and Multiculture Benchmark and LLM☆40May 18, 2026Updated 3 weeks ago
- 2D Vector-Quantized Auto-Encoder for compression of Whole-Slide Images in Histopathology☆16Jul 18, 2024Updated last year
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 10 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆18Jan 22, 2025Updated last year
- ☆11May 24, 2024Updated 2 years ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆48Aug 10, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Bilingual Medical Mixture of Experts LLM☆32Nov 23, 2024Updated last year
- A Multilingual Replicable Instruction-Following Model☆97Jun 11, 2023Updated 2 years ago
- ☆22Mar 19, 2024Updated 2 years ago
- An effort to benchmark Arabic legal reasoning in foundation models.☆19May 21, 2025Updated last year
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆27Dec 26, 2025Updated 5 months ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated 11 months ago
- Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Tra…☆16Jul 14, 2023Updated 2 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- pymur is a Python interface to The Lemur Toolkit.☆19Sep 17, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- ☆18Apr 10, 2023Updated 3 years ago
- ☆16Sep 22, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- ☆20Apr 26, 2026Updated last month
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆66Feb 29, 2024Updated 2 years ago
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- ☆15Jun 1, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Jul 3, 2025Updated 11 months ago
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆65Dec 1, 2025Updated 6 months ago
- This is a port of Mistral-7B model in JAX☆33Jul 1, 2024Updated last year
- Towards Systematic Measurement for Long Text Quality☆38Sep 5, 2024Updated last year
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- Analyzing deviation from orthogonality in RNNs☆16Oct 30, 2017Updated 8 years ago
- Lynx Game Search, built with Lynx by ByteDance, is a comprehensive app designed to provide users with detailed information about various …☆21May 20, 2025Updated last year