iAsakiT3T / SHIFNetLinks
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
☆11Updated 7 months ago
Alternatives and similar repositories for SHIFNet
Users that are interested in SHIFNet are comparing it to the libraries listed below
Sorting:
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆31Updated last month
- ☆16Updated 6 months ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆43Updated 6 months ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆61Updated 9 months ago
- [NeurIPS2024] Official Implementation of the paper [Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Se…☆26Updated 6 months ago
- ☆21Updated 3 months ago
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆45Updated 3 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆16Updated last month
- Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation☆21Updated 3 months ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆46Updated 4 months ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆37Updated this week
- Code for EventDance & EventDance++☆10Updated last year
- High Quality Video Reasoning Segmentation☆37Updated last month
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆82Updated 2 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆102Updated last year
- ☆91Updated last year
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆13Updated 10 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆172Updated 7 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆19Updated 11 months ago
- [ECCV'24] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation☆38Updated 7 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆51Updated 8 months ago
- #ICCV, #MoE, #Tracking☆15Updated 3 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆113Updated last year
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆36Updated last year
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆46Updated 4 months ago
- ☆19Updated 5 months ago
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆35Updated 10 months ago
- ☆204Updated last year
- The official implementation of "Segment Anything with Multiple Modalities".☆103Updated last year