ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark
β17May 25, 2025Updated 10 months ago
Alternatives and similar repositories for ARB
Users that are interested in ARB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ23Jan 26, 2026Updated 2 months ago
- [ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifactsβ19May 22, 2025Updated 10 months ago
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understandingβ¦β54Mar 13, 2025Updated last year
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 9 months ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NAACL 2025 π₯] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.β37Apr 17, 2025Updated 11 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ29Oct 20, 2025Updated 5 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Claβ¦β47Sep 28, 2023Updated 2 years ago
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"β¦β52Nov 14, 2023Updated 2 years ago
- A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)β22Dec 17, 2025Updated 3 months ago
- Composed Video Retrievalβ63May 2, 2024Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabiβ¦β79Sep 24, 2024Updated last year
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.β19Sep 28, 2023Updated 2 years ago
- [CVPR 2025 π₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ99Apr 14, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025 π₯] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses theβ¦β46May 26, 2025Updated 10 months ago
- A new multi-task learning framework using Vision Transformersβ11Jun 19, 2024Updated last year
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]β33Oct 27, 2024Updated last year
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)β20Aug 24, 2023Updated 2 years ago
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite foβ¦β50Aug 23, 2024Updated last year
- Kaggle Competition Dstl Satellite Imagery Feature Detectionβ10Apr 1, 2017Updated 8 years ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β23Aug 19, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accepβ¦β56Oct 22, 2024Updated last year
- [IEEE TMI 2025] MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retentionβ17Dec 15, 2025Updated 3 months ago
- [MICCAI 2024 π₯] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptiβ¦β27Aug 5, 2024Updated last year
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasksβ61Feb 20, 2026Updated last month
- [CVPR -2025] GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Modelβ131Mar 22, 2025Updated last year
- Interview questions asked in Data Science/ Machine Learning interviewsβ19Jan 15, 2020Updated 6 years ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of β¦β68Dec 3, 2023Updated 2 years ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" β¦β34Jan 8, 2023Updated 3 years ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"β26Jun 8, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)β11Dec 30, 2022Updated 3 years ago
- β18Sep 18, 2025Updated 6 months ago
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identiβ¦β13Feb 28, 2024Updated 2 years ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memoryβ61Feb 28, 2025Updated last year
- Source code for MICCAI 2022 paper entitled: 'Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification'β36Jan 13, 2023Updated 3 years ago
- Reinforcement Training of Robotβ11Dec 1, 2019Updated 6 years ago
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".β25Jul 10, 2023Updated 2 years ago