JiaDingCN / GeminiFusionView external linksLinks
☆92Aug 18, 2024Updated last year
Alternatives and similar repositories for GeminiFusion
Users that are interested in GeminiFusion are comparing it to the libraries listed below
Sorting:
- ☆27Jul 8, 2025Updated 7 months ago
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆200Aug 16, 2024Updated last year
- [RA-L 2025] CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes☆24Jan 26, 2026Updated 3 weeks ago
- Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆51Aug 25, 2025Updated 5 months ago
- This repo includes code for the paper "DynStatF: An Efficient Feature Fusion Strategy for LiDAR 3D Object Detection"☆24Dec 20, 2023Updated 2 years ago
- [CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor☆446Nov 11, 2025Updated 3 months ago
- ☆412Sep 2, 2024Updated last year
- [CVPR 2022] Code release for "Multimodal Token Fusion for Vision Transformers"☆183Jul 21, 2022Updated 3 years ago
- [IROS 2025] Official code of ”HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation“☆22Jul 25, 2025Updated 6 months ago
- [WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation☆272Sep 10, 2025Updated 5 months ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆37Nov 4, 2025Updated 3 months ago
- RGBD Pretraining code used in DFormer [ICLR 2024]☆20Jul 8, 2025Updated 7 months ago
- offical code for MMANet: Margin-aware Distillation and Modality-aware Regularization for Incomplete Multimodal Learning☆91May 26, 2024Updated last year
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆63Mar 16, 2024Updated last year
- The code for the DHViT☆12Mar 6, 2022Updated 3 years ago
- Use python3 to convert depth image into hha image☆195Apr 12, 2024Updated last year
- Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors, CVPR 2024☆22Oct 12, 2024Updated last year
- EMSANet: Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments☆71Jan 1, 2026Updated last month
- ☆22Mar 18, 2025Updated 10 months ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 6 months ago
- ☆17Jul 25, 2021Updated 4 years ago
- Multi-Sensor Place Recognition with Visual and Text Semantics☆21May 27, 2025Updated 8 months ago
- Official Pytorch implementation of "C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contrastive Learn…☆32Apr 15, 2025Updated 10 months ago
- ☆20Mar 12, 2024Updated last year
- CE-VAE Underwater Image Enhancement☆24May 19, 2025Updated 8 months ago
- ☆58Jun 3, 2024Updated last year
- Code for: Rethinking Cross-Attention for Infrared and Visible Image Fusion☆50May 10, 2024Updated last year
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆41Oct 19, 2025Updated 3 months ago
- Calibrated and Complementary Transformer for RGB-Infrared Object Detection☆100May 9, 2024Updated last year
- ☆22Dec 9, 2022Updated 3 years ago
- ☆25Jul 24, 2024Updated last year
- [CVPR 2024] SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking☆57Jun 30, 2024Updated last year
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆112Jun 26, 2024Updated last year
- We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new mode…☆31Apr 18, 2024Updated last year
- [ICIP 2022] Code for "Multi-Scale RAFT: Combining Hierarchical Concepts for Learning-Based Optical Flow Estimation"☆23Jan 22, 2024Updated 2 years ago
- Visible-Thermal UAV Tracking: A Large-Scale Benchmark (CVPR2022)☆109Feb 7, 2025Updated last year
- WaterMamba: Visual State Space Model for UnderWater Image Enhancement☆29Nov 20, 2024Updated last year
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆29Apr 20, 2025Updated 9 months ago
- Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection☆32Sep 26, 2024Updated last year