CSIPlab / MMSFormerLinks
We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
☆18Updated last year
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below
Sorting:
- ICLR2024 When Sementic Segmentation Meets Frequency Aliasing☆44Updated last year
- ☆29Updated 4 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆72Updated 6 months ago
- ☆15Updated last week
- ☆45Updated 5 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆56Updated 2 months ago
- ☆21Updated 5 months ago
- ☆84Updated 10 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆93Updated 9 months ago
- CVPR 2025 | Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond☆66Updated last month
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆62Updated 7 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆92Updated 9 months ago
- ☆65Updated last year
- ICCV2023 | Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation☆131Updated last year
- Dataset & Code for ACM Multimedia 2023 paper. "SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispec…☆13Updated 2 months ago
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆180Updated 10 months ago
- ☆63Updated 11 months ago
- ☆17Updated 2 years ago
- ☆76Updated last year
- CVPR 2024: AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation☆82Updated 2 weeks ago
- CVPR2024 Frequency-Adaptive Dilated Convolution☆29Updated last year
- ☆66Updated 2 years ago
- ☆75Updated 4 months ago
- Offical code for Multimodal Image Fusion based on Hybrid CNN-Transformer and Non-local Cross-modal Attention☆17Updated 2 years ago
- ☆52Updated last year
- ☆21Updated 3 months ago
- ☆59Updated 2 years ago
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆89Updated 2 weeks ago
- Code for the paper: "U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion", ACM MM 2023☆19Updated last year
- [NeurIPS 2024 Spotlight] Official repository of SynRS3D☆60Updated last month