iAsakiT3T / SHIFNetLinks
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
☆13Updated 2 months ago
Alternatives and similar repositories for SHIFNet
Users that are interested in SHIFNet are comparing it to the libraries listed below
Sorting:
- ☆17Updated 10 months ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Updated last year
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆65Updated 2 months ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆37Updated 3 months ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆52Updated last year
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆58Updated 6 months ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆55Updated 8 months ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆42Updated 4 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Updated 7 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆88Updated 8 months ago
- Awesome video instance segmentation papers☆50Updated last month
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆63Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Updated 10 months ago
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆61Updated 7 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 8 months ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆53Updated 4 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆45Updated 10 months ago
- [ECCV'24] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation☆40Updated 11 months ago
- [ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment☆51Updated 3 months ago
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆115Updated 8 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Updated last year
- ☆92Updated last year
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆39Updated last year
- ☆22Updated 7 months ago
- Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation☆28Updated 7 months ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆40Updated 3 months ago
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆37Updated last year
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Updated 5 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Updated last year