CSIPlab / MMSFormerLinks
We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
☆30Updated last year
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below
Sorting:
- ☆33Updated 9 months ago
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆196Updated last year
- CVPR2024 Frequency-Adaptive Dilated Convolution☆37Updated last year
- Unofficial edge detection implementation using the Automatic Mask Generation (AMG) of the Segment Anything Model (SAM).☆76Updated 5 months ago
- Code for UAED and MuGE☆90Updated 8 months ago
- ☆66Updated 2 years ago
- Code for PID: Physics-Informed Diffusion Model for Infrared Image Generation☆141Updated 3 months ago
- ☆92Updated last year
- [IEEE TPAMI'25] Low-Resolution Self-Attention For Semantic Segmentation☆55Updated 6 months ago
- ICLR2024 When Sementic Segmentation Meets Frequency Aliasing☆45Updated last year
- Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook☆73Updated 2 years ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆79Updated 11 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆98Updated last year
- Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling☆161Updated last week
- Dataset & Code for ACM Multimedia 2023 paper. "SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispec…☆14Updated 8 months ago
- (ICCV'23) Learning to Upsample by Learning to Sample☆171Updated last year
- ☆69Updated 3 years ago
- ☆22Updated 5 months ago
- Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆33Updated 3 months ago
- ☆27Updated 5 months ago
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆163Updated 11 months ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆67Updated last year
- CVPR 2025 | Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond☆89Updated 3 weeks ago
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆59Updated 3 weeks ago
- [ICCV2025] Official Pytorch Implementation of TinyViM☆103Updated 5 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆108Updated last year
- ICCV2023 | Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation☆152Updated 2 years ago
- ☆208Updated last year
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆124Updated 9 months ago
- PyTorch implementation of PaCa-ViT (CVPR'23)☆34Updated 2 years ago