mbzuai-oryx/ARB

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mbzuai-oryx/ARB)

mbzuai-oryx / ARB

ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark

☆17

Alternatives and similar repositories for ARB

Users that are interested in ARB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mbzuai-oryx / TimeTravel
View on GitHub
[ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
☆20May 22, 2025Updated last year
mbzuai-oryx / VideoMathQA
View on GitHub
VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos
☆24May 7, 2026Updated 2 months ago
mbzuai-oryx / AIN
View on GitHub
AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…
☆55Mar 13, 2025Updated last year
HashmatShadab / HSAT
View on GitHub
[MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
☆12Jun 17, 2025Updated last year
ShahinaKK / LWI-VMS
View on GitHub
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
☆22Oct 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mbzuai-oryx / DriveLMM-o1
View on GitHub
Reasoning DriveLMM
☆15Mar 15, 2025Updated last year
mbzuai-oryx / Camel-Bench
View on GitHub
[NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.
☆38Apr 17, 2025Updated last year
amandpkr / XM-GAN
View on GitHub
[MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…
☆47Sep 28, 2023Updated 2 years ago
HashmatShadab / Robust-LLaVA
View on GitHub
[ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
☆29Oct 20, 2025Updated 9 months ago
asif-hanif / vafa
View on GitHub
[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…
☆52Nov 14, 2023Updated 2 years ago
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
techmn / cdchat
View on GitHub
A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)
☆22Dec 17, 2025Updated 7 months ago
mbzuai-oryx / ClimateGPT
View on GitHub
[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…
☆79Sep 24, 2024Updated last year
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mbzuai-oryx / ALM-Bench
View on GitHub
[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…
☆47May 26, 2025Updated last year
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
hananshafi / MTL-ViT
View on GitHub
A new multi-task learning framework using Vision Transformers
☆11Jun 19, 2024Updated 2 years ago
htqin / GoogleBard-VisUnderstand
View on GitHub
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges
☆30Sep 24, 2023Updated 2 years ago
ShahinaKK / LG_SDG
View on GitHub
Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]
☆33Oct 27, 2024Updated last year
mbzuai-oryx / VideoGLaMM
View on GitHub
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
☆104Apr 14, 2025Updated last year
HashmatShadab / Robustness-of-Volumetric-Medical-Segmentation-Models
View on GitHub
[BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
☆15Nov 1, 2024Updated last year
OmkarThawakar / Self-Learning-Robot
View on GitHub
Reinforcement Training of Robot
☆11Dec 1, 2019Updated 6 years ago
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Hasindri / HLSS
View on GitHub
[MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descripti…
☆27Aug 5, 2024Updated last year
asif-hanif / baple
View on GitHub
[MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accep…
☆56Oct 22, 2024Updated last year
PiercingDan / kaggle-dstl
View on GitHub
Kaggle Competition Dstl Satellite Imagery Feature Detection
☆10Apr 1, 2017Updated 9 years ago
mbzuai-oryx / EvoLMM
View on GitHub
Self Evolving Large Multimodal Models with Continuous Rewards
☆25Jun 9, 2026Updated last month
Amshaker / GroupMamba
View on GitHub
[CVPR -2025] GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
☆142Mar 22, 2025Updated last year
Muzammal-Naseer / DCViT-AT
View on GitHub
Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)
☆20Aug 24, 2023Updated 2 years ago
aminebdj / 3D-OWIS
View on GitHub
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …
☆68Dec 3, 2023Updated 2 years ago
HashmatShadab / APR
View on GitHub
(BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …
☆35Jan 8, 2023Updated 3 years ago
mbzuai-oryx / KITAB-Bench
View on GitHub
[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
☆76May 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Hanzy1996 / OpenSeg-R
View on GitHub
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
☆29May 24, 2025Updated last year
HashmatShadab / MambaRobustness
View on GitHub
[CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"
☆26Jun 8, 2025Updated last year
sen-mao / FasterVAR
View on GitHub
[ICML2026] Official Implementations "FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models"
☆27Jul 9, 2026Updated 2 weeks ago
Amshaker / MAVOS
View on GitHub
[WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory
☆61Feb 28, 2025Updated last year
mustansarfiaz / PS-ARM
View on GitHub
Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…
☆13Feb 28, 2024Updated 2 years ago
Muzammal-Naseer / SAT
View on GitHub
Official repository for "Stylized Adversarial Training" (TPAMI 2022)
☆11Dec 30, 2022Updated 3 years ago
rohit901 / VANE-Bench
View on GitHub
[NAACL'25] Contains code and documentation for our VANE-Bench paper.
☆24Aug 19, 2025Updated 11 months ago