[CVPR 2025 π₯] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the next generation of LMMs on cultural inclusitivity.
β46May 26, 2025Updated 9 months ago
Alternatives and similar repositories for ALM-Bench
Users that are interested in ALM-Bench are comparing it to the libraries listed below
Sorting:
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.β19Sep 28, 2023Updated 2 years ago
- β11Oct 29, 2024Updated last year
- A new multi-task learning framework using Vision Transformersβ11Jun 19, 2024Updated last year
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite foβ¦β50Aug 23, 2024Updated last year
- [ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifactsβ18May 22, 2025Updated 9 months ago
- [BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"β28Dec 18, 2025Updated 2 months ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)β20Aug 24, 2023Updated 2 years ago
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]β32Oct 27, 2024Updated last year
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"β¦β52Nov 14, 2023Updated 2 years ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Claβ¦β47Sep 28, 2023Updated 2 years ago
- [NAACL 2025 π₯] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.β36Apr 17, 2025Updated 10 months ago
- [MICCAI 2024 π₯] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptiβ¦β27Aug 5, 2024Updated last year
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β23Aug 19, 2025Updated 6 months ago
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes πππβ37Jan 21, 2025Updated last year
- [CVPR 2025 π₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ97Apr 14, 2025Updated 10 months ago
- Bilingual Medical Mixture of Experts LLMβ32Nov 23, 2024Updated last year
- [β CVPR 2025 Highlight β] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing froβ¦β29Apr 22, 2025Updated 10 months ago
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".β25Jul 10, 2023Updated 2 years ago
- β10Feb 20, 2024Updated 2 years ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"β14Nov 1, 2024Updated last year
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)β11Aug 11, 2025Updated 6 months ago
- β42Nov 9, 2023Updated 2 years ago
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representationβ17Feb 8, 2024Updated 2 years ago
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.β19May 6, 2025Updated 10 months ago
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accepβ¦β56Oct 22, 2024Updated last year
- β21Jul 25, 2025Updated 7 months ago
- [ CVPR 2025 π₯] STING-BEE, the first domain-aware visual AI assistant for X-ray baggage security screening.β24Jun 27, 2025Updated 8 months ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)β13Mar 8, 2024Updated last year
- β35Feb 5, 2024Updated 2 years ago
- SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation (BMVC'23 -- Oral)β19Dec 14, 2023Updated 2 years ago
- Validating image classification benchmark results on ViTs and ResNets (v2)β13Nov 3, 2022Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.β16Aug 23, 2023Updated 2 years ago
- 3D Mitochondria Instance Segmentation with Spatio-Temporal Transformersβ14Apr 17, 2023Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)β16Jan 18, 2024Updated 2 years ago
- A Comprehensive Benchmark for Robust Multi-image Understandingβ19Sep 4, 2024Updated last year