mbzuai-oryx / ALM-BenchView external linksLinks
[CVPR 2025 π₯] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the next generation of LMMs on cultural inclusitivity.
β47May 26, 2025Updated 8 months ago
Alternatives and similar repositories for ALM-Bench
Users that are interested in ALM-Bench are comparing it to the libraries listed below
Sorting:
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 7 months ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detectionβ30Jun 21, 2023Updated 2 years ago
- β11Oct 29, 2024Updated last year
- A new multi-task learning framework using Vision Transformersβ11Jun 19, 2024Updated last year
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite foβ¦β50Aug 23, 2024Updated last year
- [ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifactsβ18May 22, 2025Updated 8 months ago
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmarkβ17May 25, 2025Updated 8 months ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β17Aug 19, 2025Updated 5 months ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ22Jan 26, 2026Updated 2 weeks ago
- [MICCAI 2024 π₯] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptiβ¦β27Aug 5, 2024Updated last year
- A codeβ29Jan 23, 2025Updated last year
- [β CVPR 2025 Highlight β] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing froβ¦β29Apr 22, 2025Updated 9 months ago
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".β25Jul 10, 2023Updated 2 years ago
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)β11Aug 11, 2025Updated 6 months ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"β14Nov 1, 2024Updated last year
- β42Nov 9, 2023Updated 2 years ago
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.β19May 6, 2025Updated 9 months ago
- [ CVPR 2025 π₯] STING-BEE, the first domain-aware visual AI assistant for X-ray baggage security screening.β23Jun 27, 2025Updated 7 months ago
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accepβ¦β56Oct 22, 2024Updated last year
- β35Feb 5, 2024Updated 2 years ago
- Validating image classification benchmark results on ViTs and ResNets (v2)β13Nov 3, 2022Updated 3 years ago
- SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation (BMVC'23 -- Oral)β19Dec 14, 2023Updated 2 years ago
- A Comprehensive Benchmark for Robust Multi-image Understandingβ17Sep 4, 2024Updated last year
- β18Sep 23, 2024Updated last year
- β16Oct 21, 2024Updated last year
- Official implementation of the paper "FedSIS: Federated Split Learning with Intermediate Representation Sampling for Privacy-preserving Gβ¦β16Sep 5, 2023Updated 2 years ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing theirβ¦β20Jan 11, 2026Updated last month
- [CVPR'25]Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacksβ29Jun 12, 2025Updated 8 months ago
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabiβ¦β79Sep 24, 2024Updated last year
- [InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Clβ¦β38Dec 11, 2024Updated last year
- [CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignmentβ27Jun 11, 2025Updated 8 months ago
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentationβ41Sep 2, 2022Updated 3 years ago
- β47Nov 7, 2024Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)β25May 16, 2024Updated last year
- An Open-source Factuality Evaluation Demo for LLMsβ31Aug 10, 2025Updated 6 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoningβ24Sep 9, 2024Updated last year
- β54Jan 17, 2025Updated last year
- [ACL 2025 π₯] Rethinking Step-by-step Visual Reasoning in LLMsβ310May 21, 2025Updated 8 months ago