We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates the proposed fusion strategy to perform multimodal material and semantic segmentation tasks.
β33Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for MMSFormer
Users that are interested in MMSFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACMMM2025 Oral π] Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentationβ62Aug 25, 2025Updated 10 months ago
- β25Sep 29, 2025Updated 9 months ago
- Neural Transmitted Radiance Fieldsβ12Apr 11, 2024Updated 2 years ago
- β74Nov 29, 2023Updated 2 years ago
- Learning Representations that Support Robust Transfer of Predictorsβ20Nov 7, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)β209Aug 16, 2024Updated last year
- Official implementation of "Exploiting the Signal-Leak Bias in Diffusion Models" (WACV 2024)β20Apr 10, 2026Updated 2 months ago
- CVPR 2024 Official Repositoryβ13Mar 27, 2024Updated 2 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"β15Oct 12, 2023Updated 2 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023β16Jul 24, 2023Updated 2 years ago
- [IEEE TMI 2025] MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retentionβ19Dec 15, 2025Updated 6 months ago
- Official repository for paper "Deformable Cross-Attention Transformer for Weakly Aligned RGB-T Pedestrian Detection", IEEE transactions oβ¦β16May 28, 2025Updated last year
- 2018 Master's Project: Program to do online 3D reconstruction with pose correction of UAV. High localization and object dimensional accurβ¦β13Oct 20, 2018Updated 7 years ago
- Pytorch implementation of our WACV 2023 paper "Image-Consistent Detection of Road Anomalies As Unpredictable Patches"β12May 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)β26Apr 10, 2026Updated 2 months ago
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectorsβ28Jul 10, 2025Updated 11 months ago
- β16Aug 17, 2021Updated 4 years ago
- Reflection Removal Using a Dual-Pixel Sensor, CVPR 2019β17Jun 14, 2019Updated 7 years ago
- Platform-agnostic toolkit to spin up vLLM endpoints and submit high-throughput jobs (DataFrame or scripts) across Slurm and DGX Cloud Lepβ¦β23Updated this week
- β14Jun 20, 2023Updated 3 years ago
- Multi Task Learning for Semantic Segmentation, Instance Segmentation and Depth Estimationβ12Jun 12, 2022Updated 4 years ago
- β17Sep 27, 2023Updated 2 years ago
- Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"β17Aug 7, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- multimodal anomaly detectionβ14Jan 17, 2021Updated 5 years ago
- Satellite Image Based Cross-view Localization for Autonomous Vehicle, ICRA2023β17Oct 29, 2024Updated last year
- semantic point cloud, localization, map matching, semantic point cloud map, image segmentation, image detectionβ23Aug 26, 2023Updated 2 years ago
- Birdβs-eye view map from monocular cameras using BEVFormer + HOP methods.β16Jan 17, 2024Updated 2 years ago
- Model for Monocular Depth Estimation and Image Segmentationβ14Jul 31, 2021Updated 4 years ago
- β13May 30, 2025Updated last year
- Text indexing related functions in Go, including tokenizer, word marking, and snippet selecting, etc.β26Feb 14, 2016Updated 10 years ago
- Multimodal Supervised Variational Autoencoderβ19Nov 3, 2020Updated 5 years ago
- The official PyTorch implementation of our paper (Simple and Efficient: A Semisupervised Learning Framework for Remote Sensing Image Semaβ¦β22May 9, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modalityβ33Nov 25, 2024Updated last year
- Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IANβ19Oct 3, 2024Updated last year
- Practical Depth Estimation with Image Segmentation and Serial U-Netsβ16May 25, 2020Updated 6 years ago
- Multi-View Image Fusion (uav * sat)β18Sep 15, 2023Updated 2 years ago
- (IJCV 2025) Official Pytorch implementation of "C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contrβ¦β42Apr 15, 2025Updated last year
- Framework, which loads lidar pointclouds and converts them into a Bird's Eye View RGB imageβ16Mar 30, 2020Updated 6 years ago
- This is a PyTorch implementation of a Bayesian Convolutional Neural Network (BCNN) for Semantic Scene Completion on the SUNCG dataset. Giβ¦β15Mar 30, 2023Updated 3 years ago