iAsakiT3T / SHIFNetLinks
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
☆13Updated last month
Alternatives and similar repositories for SHIFNet
Users that are interested in SHIFNet are comparing it to the libraries listed below
Sorting:
- ☆18Updated 9 months ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated last year
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆42Updated 3 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆52Updated 9 months ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆53Updated 7 months ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆36Updated 2 months ago
- ☆22Updated 7 months ago
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆63Updated 2 months ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆56Updated 5 months ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Updated last year
- [ECCV'24] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation☆39Updated 11 months ago
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆64Updated last year
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆58Updated 6 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Updated last year
- This is the GitHub repository for Data Augmentation for Saliency Prediction via Latent Diffusion paper in ECCV 2024, Milano, Italy☆14Updated last year
- Awesome video instance segmentation papers☆50Updated last month
- [ECCV 2024] Official project of CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning☆44Updated last year
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Updated 6 months ago
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆37Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆52Updated 11 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆87Updated 7 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆45Updated 9 months ago
- ☆92Updated last year
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆115Updated 8 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆109Updated last year
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆53Updated 4 months ago
- [ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment☆48Updated 3 months ago
- Code for Panoramic Semantic Segmentation☆14Updated last year
- ☆27Updated last year
- Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation☆27Updated 6 months ago