☆29Sep 17, 2024Updated last year
Alternatives and similar repositories for ArabicMMLU
Users that are interested in ArabicMMLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆33Apr 18, 2026Updated 2 months ago
- ☆128Mar 3, 2024Updated 2 years ago
- Arabic theme for blogging☆17Nov 16, 2023Updated 2 years ago
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆19May 22, 2025Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated last year
- Multilingual and Multiculture Benchmark and LLM☆41May 18, 2026Updated last month
- 2D Vector-Quantized Auto-Encoder for compression of Whole-Slide Images in Histopathology☆16Jul 18, 2024Updated last year
- ✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks☆18Aug 16, 2024Updated last year
- [ICML 2025] QT-DOG: QUANTIZATION-AWARE TRAINING FOR DOMAIN GENERALIZATION☆25Nov 30, 2025Updated 6 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆18Jan 22, 2025Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆49Aug 10, 2025Updated 10 months ago
- ☆13Jun 16, 2021Updated 5 years ago
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11May 9, 2022Updated 4 years ago
- An effort to benchmark Arabic legal reasoning in foundation models.☆19May 21, 2025Updated last year
- A Comprehensive Rare Disease Diagnostic Dataset with nearly 50,000 patients covering more than 4000 diseases☆45Mar 13, 2026Updated 3 months ago
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆28Dec 26, 2025Updated 6 months ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated last year
- Clone of the TikTok application☆10Sep 29, 2021Updated 4 years ago
- ☆12Jan 6, 2021Updated 5 years ago
- pymur is a Python interface to The Lemur Toolkit.☆19Sep 17, 2018Updated 7 years ago
- NLP Course (ICTS6361)☆11Jan 10, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated last year
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- ☆71Jul 2, 2025Updated 11 months ago
- ☆16Sep 22, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 11 months ago
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accep…☆56Oct 22, 2024Updated last year
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆34Jun 9, 2024Updated 2 years ago
- A sample agent demonstrating A2A + ADK + MCP working together.☆36Jun 4, 2026Updated 3 weeks ago
- docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Lear…☆11May 19, 2026Updated last month
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆46May 26, 2025Updated last year
- ☆15Jul 3, 2025Updated 11 months ago
- Matrix exponential in cuda for pytorch and tensorflow☆17Nov 26, 2018Updated 7 years ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆28Mar 11, 2025Updated last year