GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]
☆132Mar 22, 2025Updated 11 months ago
Alternatives and similar repositories for GroupMamba
Users that are interested in GroupMamba are comparing it to the libraries listed below
Sorting:
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆60Feb 28, 2025Updated last year
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos☆22Jan 26, 2026Updated last month
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 8 months ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Sep 24, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark☆17May 25, 2025Updated 9 months ago
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".☆25Jul 10, 2023Updated 2 years ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆80Dec 25, 2024Updated last year
- ☆66Sep 11, 2024Updated last year
- ☆26Oct 15, 2024Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90May 30, 2025Updated 9 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆196Mar 4, 2025Updated 11 months ago
- [CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"☆118Apr 22, 2025Updated 10 months ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Jan 8, 2023Updated 3 years ago
- ☆55Apr 28, 2025Updated 10 months ago
- Code Implementation of EfficientVMamba☆243Apr 16, 2024Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆274May 6, 2024Updated last year
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)☆22Dec 17, 2025Updated 2 months ago
- VMamba: Visual State Space Models,code is based on mamba☆3,046Mar 7, 2025Updated 11 months ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…☆52Nov 14, 2023Updated 2 years ago
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆135Aug 6, 2025Updated 6 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…☆47Sep 28, 2023Updated 2 years ago
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…☆50Aug 23, 2024Updated last year
- [ICCV2025] Introduce Mamba2 to Vision.☆185Oct 29, 2025Updated 4 months ago
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆24Oct 4, 2025Updated 4 months ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 8 months ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆96Apr 14, 2025Updated 10 months ago
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,034Feb 9, 2026Updated 3 weeks ago
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]☆32Oct 27, 2024Updated last year
- Code for the paper: "FusionMamba: Efficient Image Fusion with State Space Model", TGRS, 2024.☆135Jan 31, 2026Updated last month
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆17Aug 19, 2025Updated 6 months ago
- CGGLNet: Semantic Segmentation Network for Remote Sensing Images Based on Category-Guided Global-Local Feature Interaction☆21Sep 2, 2025Updated 6 months ago
- ☆21Dec 14, 2025Updated 2 months ago
- [NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.☆36Apr 17, 2025Updated 10 months ago