We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
☆31Apr 18, 2024Updated last year
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below
Sorting:
- Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆50Aug 25, 2025Updated 6 months ago
- ☆16Aug 17, 2021Updated 4 years ago
- TupleInfoNCE ICCV21☆17Jul 22, 2022Updated 3 years ago
- ☆68Nov 29, 2023Updated 2 years ago
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 7 months ago
- [CVPR2023] The official repository for paper "Learning Partial Correlation based Deep Visual Representation for Image Classification" To …☆10Nov 21, 2023Updated 2 years ago
- PyTorch implementation of Multi-Perspective Data Augmentation for Few-shot Object Detection☆22Apr 15, 2025Updated 10 months ago
- This project focuses on developing a machine learning model to classify various electrical fault types in a transmission line. The model …☆15Apr 9, 2024Updated last year
- [WACV 2025] "Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression"☆19Oct 14, 2025Updated 4 months ago
- Multi Task Learning for Semantic Segmentation, Instance Segmentation and Depth Estimation☆12Jun 12, 2022Updated 3 years ago
- This repo proves that sythtic dataset along with real world dataset can boost the performance of models for Pedestrian Intention Predicti…☆13Mar 24, 2025Updated 11 months ago
- Implementation of "Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes"☆12Oct 2, 2024Updated last year
- HSViT: Horizontally Scalable Vision Transformer☆13Nov 6, 2024Updated last year
- PyTorch implementation of quantization-aware matrix factorization (QMF) for data compression☆15Jul 14, 2025Updated 7 months ago
- Python script that can be used to generate latitude/longitude coordinates for GOES-16 full-disk extent.☆10Jan 26, 2022Updated 4 years ago
- ☆13May 30, 2025Updated 9 months ago
- Tutorial on deep generative models with experiments on MNIST☆11Nov 7, 2018Updated 7 years ago
- [ACM MM23] Pytorch implementation for paper: SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification☆12Jul 4, 2023Updated 2 years ago
- Biometric systems have become a major part of research due its application of identification. Code provides a multimodal biometric system…☆11Aug 16, 2016Updated 9 years ago
- ☆31Nov 20, 2025Updated 3 months ago
- Code for the DASFAA 2023 paper "Rainfall Spatial Interpolation with Graph Neural Networks".☆10Jul 17, 2023Updated 2 years ago
- [Ongoing Project] Codebase for network quantization study.☆12May 20, 2020Updated 5 years ago
- The official implementation of DRENet (Degraded Reconstruction Enhancement Network) for tiny ship detection in remote sensing Images☆53Jul 15, 2023Updated 2 years ago
- A GAN architecture conditioned on Action Units (AU) annotations generating facial expressions in a continuous domain.☆11Nov 22, 2022Updated 3 years ago
- The official implementation code of Paper "PointCVaR: Risk-optimized Outlier Removal for Robust 3D Point Cloud Classification" in AAAI 20…☆16Mar 27, 2024Updated last year
- Improving beat tracking algorithms with recurrent neural networks.☆11Jan 7, 2019Updated 7 years ago
- [CVPR 2024] No More Ambiguity in 360° Room Layout via Bi-Layout Estimation☆17Oct 9, 2024Updated last year
- An implementation of tone enhancement. May refer to "Two-scale Tone Management for Photographic Look", SIGGRAPH 2006.☆13Mar 30, 2017Updated 8 years ago
- ☆11Jun 21, 2022Updated 3 years ago
- [WACV 2021] Selective Spatio-Temporal Aggregation based Pose Refinement System: Towards understanding human activities in real-world vide…☆13Nov 4, 2021Updated 4 years ago
- CRNN_CTC_PyTorch☆10Oct 17, 2019Updated 6 years ago
- Some PyTorch code for the Kaggle Speech Recognition Challenge☆12Feb 7, 2019Updated 7 years ago
- University of Sheffield Research Software Engineering team's website☆16Feb 23, 2026Updated last week
- ☆11Jul 14, 2024Updated last year
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates☆14Mar 11, 2025Updated 11 months ago
- A molecule generation benchmarking platform☆13Feb 22, 2018Updated 8 years ago
- ☆19Jan 19, 2026Updated last month
- Code execution runtime for the Cloud Cover competition☆11Jan 31, 2022Updated 4 years ago