CSIPlab/MMSFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CSIPlab/MMSFormer)

CSIPlab / MMSFormer

We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.

☆33

Alternatives and similar repositories for MMSFormer

Users that are interested in MMSFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LiBingyu01 / StitchFusion
View on GitHub
[ACMMM2025 Oral 🌟] Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation
☆62Aug 25, 2025Updated 11 months ago
kyotovision-public / multimodal-material-segmentation
View on GitHub
☆74Nov 29, 2023Updated 2 years ago
Mhaiyang / CVPR2022_PGSNet
View on GitHub
☆34Mar 23, 2024Updated 2 years ago
InSAI-Lab / DELIVER
View on GitHub
Repository of DELIVER dataset and CMNeXt models (CVPR 2023)
☆211Aug 16, 2024Updated last year
Multi-Modality-Tracking / CKD-ACMMM2024
View on GitHub
Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation ACMMM2024
☆23Oct 16, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fangyuanmao / UNIV
View on GitHub
☆17May 22, 2025Updated last year
aurooj / MMFT-BERT
View on GitHub
☆14Jun 29, 2024Updated 2 years ago
WenliangDu / MambaDiffusion
View on GitHub
☆19Nov 11, 2024Updated last year
FreeButUselessSoul / TNeRF
View on GitHub
Neural Transmitted Radiance Fields
☆12Apr 11, 2024Updated 2 years ago
LiBingyu01 / U3M
View on GitHub
[Pattern Recognition 2025 🌟]Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
☆10Jun 12, 2024Updated 2 years ago
CVEO / RoofMapNet
View on GitHub
☆22Jun 23, 2025Updated last year
yue-zhongqi / tif
View on GitHub
CVPR 2024 Official Repository
☆13Mar 27, 2024Updated 2 years ago
VCIP-RGBD / RGBD-Pretrain
View on GitHub
RGBD Pretraining code used in DFormer [ICLR 2024]
☆21Jul 8, 2025Updated last year
navv37 / Dual-pol-powers
View on GitHub
☆12Jan 31, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ShaohuaDong2021 / DPLNet
View on GitHub
☆29Jul 8, 2025Updated last year
hayatkhan8660-maker / Fire_Seg_Dataset
View on GitHub
Official repository of "Efficient Fire Segmentation for Internet-of-Things-Assisted Intelligent Transportation Systems" [IEEE TITS 2022]
☆15Dec 22, 2024Updated last year
Retinal-Research / OTRE
View on GitHub
Code for the paper "OTRE: Where Optimal Transport Guided Unpaired Image-to-Image Translation Meets Regularization by Enhancing"
☆11Aug 2, 2025Updated 11 months ago
Claud1234 / CLFT
View on GitHub
This is the repository for FCN and Transformer based object segmentation that relies on the fusion of camera and LiDAR data.
☆38Feb 6, 2026Updated 5 months ago
yaoweilee / PMF
View on GitHub
Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023
☆16Jul 24, 2023Updated 3 years ago
MathLee / LASNet
View on GitHub
[TCSVT2023] [LASNet] RGB-T Semantic Segmentation with Location, Activation, and Sharpening
☆32Jan 13, 2026Updated 6 months ago
61s61min / MS2Fusion
View on GitHub
☆17Jun 3, 2026Updated last month
seqml / ConTSG-Bench
View on GitHub
Official code for "ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation" （ICML 2026）
☆17May 2, 2026Updated 2 months ago
52CV / WACV-2025-Papers
View on GitHub
☆41Jun 30, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
CalvinYang0 / CRNet
View on GitHub
☆87Jun 27, 2024Updated 2 years ago
vojirt / DaCUP
View on GitHub
Pytorch implementation of our WACV 2023 paper "Image-Consistent Detection of Road Anomalies As Unpredictable Patches"
☆12May 29, 2024Updated 2 years ago
D0miH / does-clip-know-my-face
View on GitHub
Source Code for the JAIR Paper "Does CLIP Know my Face?" (Demo: https://huggingface.co/spaces/AIML-TUDA/does-clip-know-my-face)
☆15Jul 9, 2024Updated 2 years ago
zhoujiahuan1991 / MM2024-InsVP
View on GitHub
☆15May 5, 2025Updated last year
jhshim1995 / FeedFormer
View on GitHub
☆21Aug 29, 2022Updated 3 years ago
dongjunhwang / ConOVS
View on GitHub
Official Implementation of "OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation" (NeurIPS 2025).
☆16Feb 27, 2026Updated 4 months ago
Shaoli-Huang / SPS
View on GitHub
☆16Aug 17, 2021Updated 4 years ago
mi18 / SNDF
View on GitHub
Superpixel-enhanced Deep Neural Forest for Remote Sensing Image Semantic Segmentation
☆15Oct 14, 2020Updated 5 years ago
Zhiyuan-Li-John / MuCR
View on GitHub
MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities
☆20May 27, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
abhijithpunnappurath / dprr
View on GitHub
Reflection Removal Using a Dual-Pixel Sensor, CVPR 2019
☆17Jun 14, 2019Updated 7 years ago
miquel-espinosa / COP-GEN
View on GitHub
[preprint] 🌍 COP-GEN: Latent Diffusion Transformer for Copernicus Earth Observation Data
☆18Apr 28, 2026Updated 2 months ago
Chenfei-Liao / MemorySAM
View on GitHub
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation
☆45Nov 4, 2025Updated 8 months ago
aerorobotics / caltech-aerial-rgbt-dataset
View on GitHub
☆90Jan 10, 2025Updated last year
ariyanzri / MegaStitch
View on GitHub
☆14Jun 20, 2023Updated 3 years ago
cdbharath / multitask-seg-depth
View on GitHub
Multi Task Learning for Semantic Segmentation, Instance Segmentation and Depth Estimation
☆12Jun 12, 2022Updated 4 years ago
riteshsonavane / Text2Face
View on GitHub
Text-to-face implementation using AttnGan architecture.
☆17Feb 27, 2022Updated 4 years ago