We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
☆32Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACMMM2025 Oral 🌟] Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆59Aug 25, 2025Updated 8 months ago
- TupleInfoNCE ICCV21☆17Jul 22, 2022Updated 3 years ago
- ☆14Jun 29, 2024Updated last year
- ☆24Sep 29, 2025Updated 7 months ago
- [Pattern Recognition 2025 🌟]Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation☆10Jun 12, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆204Aug 16, 2024Updated last year
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- [IEEE TMI 2025] MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention☆18Dec 15, 2025Updated 4 months ago
- 2018 Master's Project: Program to do online 3D reconstruction with pose correction of UAV. High localization and object dimensional accur…☆13Oct 20, 2018Updated 7 years ago
- ☆15May 5, 2025Updated 11 months ago
- Source Code for the JAIR Paper "Does CLIP Know my Face?" (Demo: https://huggingface.co/spaces/AIML-TUDA/does-clip-know-my-face)☆16Jul 9, 2024Updated last year
- Superpixel-enhanced Deep Neural Forest for Remote Sensing Image Semantic Segmentation☆15Oct 14, 2020Updated 5 years ago
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)☆26Apr 10, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Reflection Removal Using a Dual-Pixel Sensor, CVPR 2019☆17Jun 14, 2019Updated 6 years ago
- ☆23Sep 24, 2024Updated last year
- 实现机器学习实战以及关于周志华西瓜书中的一些扩展算法等☆10Oct 9, 2018Updated 7 years ago
- ☆14Jun 20, 2023Updated 2 years ago
- Text-to-face implementation using AttnGan architecture.☆17Feb 27, 2022Updated 4 years ago
- A tool to render 3D gaussian splatting(3DGS) .ply files to an image in real time by given a camera pose. Use python and CUDA.☆25Apr 24, 2025Updated last year
- ☆17Sep 27, 2023Updated 2 years ago
- Original OpenShoe Matlab algorithm rewritten in C to be used in real-time application☆16Jul 7, 2021Updated 4 years ago
- multimodal anomaly detection☆14Jan 17, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Satellite Image Based Cross-view Localization for Autonomous Vehicle, ICRA2023☆16Oct 29, 2024Updated last year
- Bird’s-eye view map from monocular cameras using BEVFormer + HOP methods.☆16Jan 17, 2024Updated 2 years ago
- Repository for the "AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation" paper☆23Oct 25, 2025Updated 6 months ago
- Model for Monocular Depth Estimation and Image Segmentation☆14Jul 31, 2021Updated 4 years ago
- ☆18Dec 9, 2021Updated 4 years ago
- ☆13May 30, 2025Updated 11 months ago
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆22Feb 5, 2024Updated 2 years ago
- Multimodal Supervised Variational Autoencoder☆19Nov 3, 2020Updated 5 years ago
- ☆22Apr 4, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official PyTorch implementation of our paper (Simple and Efficient: A Semisupervised Learning Framework for Remote Sensing Image Sema…☆22May 9, 2023Updated 2 years ago
- Implementation of View-volume network for semantic scene completion from a single depth image☆14Nov 15, 2019Updated 6 years ago
- Practical Depth Estimation with Image Segmentation and Serial U-Nets☆16May 25, 2020Updated 5 years ago
- Multi-View Image Fusion (uav * sat)☆17Sep 15, 2023Updated 2 years ago
- (IJCV 2025) Official Pytorch implementation of "C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contr…☆38Apr 15, 2025Updated last year
- Framework, which loads lidar pointclouds and converts them into a Bird's Eye View RGB image☆16Mar 30, 2020Updated 6 years ago
- This is a PyTorch implementation of a Bayesian Convolutional Neural Network (BCNN) for Semantic Scene Completion on the SUNCG dataset. Gi…☆15Mar 30, 2023Updated 3 years ago