CSIPlab / MMSFormerLinks
We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
☆31Updated last year
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below
Sorting:
- Official implementation of the CVPR 2024 paper "Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling"☆23Updated last year
- ☆22Updated 7 months ago
- ☆33Updated 11 months ago
- ICLR2024 When Sementic Segmentation Meets Frequency Aliasing☆46Updated last year
- Code for UAED and MuGE☆91Updated 9 months ago
- Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling☆194Updated last month
- ☆63Updated 2 years ago
- ☆92Updated last year
- Unofficial edge detection implementation using the Automatic Mask Generation (AMG) of the Segment Anything Model (SAM).☆76Updated 7 months ago
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆198Updated last year
- CVPR2024 Frequency-Adaptive Dilated Convolution☆38Updated last year
- ☆70Updated 3 years ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆130Updated 10 months ago
- ☆50Updated last year
- [IEEE TPAMI'25] Low-Resolution Self-Attention For Semantic Segmentation☆66Updated 7 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆110Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆100Updated last year
- Dataset & Code for ACM Multimedia 2023 paper. "SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispec…☆14Updated 9 months ago
- Code for PID: Physics-Informed Diffusion Model for Infrared Image Generation☆148Updated 4 months ago
- (ICCV'23) Learning to Upsample by Learning to Sample☆182Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆79Updated last year
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆82Updated 9 months ago
- Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆49Updated 5 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆132Updated 10 months ago
- ☆210Updated last year
- Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook☆73Updated 2 years ago
- ☆68Updated 2 years ago
- ☆55Updated last month
- PyTorch implementation of PaCa-ViT (CVPR'23)☆35Updated 2 years ago
- ☆45Updated last year