☆93Aug 18, 2024Updated last year
Alternatives and similar repositories for GeminiFusion
Users that are interested in GeminiFusion are comparing it to the libraries listed below
Sorting:
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆201Aug 16, 2024Updated last year
- [RA-L 2025] CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes☆25Jan 26, 2026Updated last month
- ☆417Sep 2, 2024Updated last year
- [ACMMM2025 Oral 🌟] Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation☆53Aug 25, 2025Updated 6 months ago
- [WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation☆279Sep 10, 2025Updated 6 months ago
- [CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor☆450Nov 11, 2025Updated 4 months ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆40Nov 4, 2025Updated 4 months ago
- Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation ACMMM2024☆22Oct 16, 2024Updated last year
- Use python3 to convert depth image into hha image☆196Apr 12, 2024Updated last year
- Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)☆29Apr 1, 2024Updated last year
- EMSANet: Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments☆70Jan 1, 2026Updated 2 months ago
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆63Mar 16, 2024Updated 2 years ago
- Utility to convert the NYU Depth V2 dataset into point clouds for advanced 3D visualization and analysis.☆14Nov 27, 2024Updated last year
- (IJCV 2025) Official Pytorch implementation of "C2RF: Bridging Multi-modal Image Registration and Fusion via Commonality Mining and Contr…☆36Apr 15, 2025Updated 11 months ago
- [IROS 2025] Official code of ”HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation“☆23Jul 25, 2025Updated 7 months ago
- ☆22Mar 18, 2025Updated last year
- Code for: Rethinking Cross-Attention for Infrared and Visible Image Fusion☆51May 10, 2024Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 7 months ago
- The code for the DHViT☆12Mar 6, 2022Updated 4 years ago
- Multi-Sensor Place Recognition with Visual and Text Semantics☆21May 27, 2025Updated 9 months ago
- ☆24Jun 19, 2025Updated 9 months ago
- Implementation of various attention mechanisms☆16Sep 8, 2021Updated 4 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- Visible-Thermal UAV Tracking: A Large-Scale Benchmark (CVPR2022)☆111Feb 7, 2025Updated last year
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 7 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆30Apr 20, 2025Updated 11 months ago
- Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes☆18Aug 24, 2025Updated 6 months ago
- ☆61Jun 3, 2024Updated last year
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆114Jun 26, 2024Updated last year
- ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification☆55Dec 5, 2024Updated last year
- ☆26Oct 15, 2024Updated last year
- Calibrated and Complementary Transformer for RGB-Infrared Object Detection☆106May 9, 2024Updated last year
- ☆23Apr 12, 2023Updated 2 years ago
- Dual Contrastive Learning for Few-shot Medical Image Segmentation☆28Mar 2, 2023Updated 3 years ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆59Nov 5, 2024Updated last year
- [ICRA-2025] Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention Mechanisms☆44Nov 26, 2025Updated 3 months ago
- ☆22Dec 9, 2022Updated 3 years ago
- The official code of "A Two-stage hybrid CNN-Transformer Network for RGB Guided Indoor Depth Completion"☆12Sep 6, 2023Updated 2 years ago
- ☆12Aug 22, 2023Updated 2 years ago