iAsakiT3T / SHIFNetLinks
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
☆10Updated 6 months ago
Alternatives and similar repositories for SHIFNet
Users that are interested in SHIFNet are comparing it to the libraries listed below
Sorting:
- ☆13Updated 5 months ago
- [NeurIPS2024] Official Implementation of the paper [Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Se…☆26Updated 6 months ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 11 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆43Updated 5 months ago
- Code for EventDance & EventDance++☆10Updated last year
- Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation☆18Updated 2 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆113Updated last year
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆34Updated 3 weeks ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆30Updated 3 weeks ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆43Updated 3 months ago
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆34Updated 9 months ago
- Code for Panoramic Semantic Segmentation☆13Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆50Updated 7 months ago
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆59Updated last year
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆42Updated 2 months ago
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆43Updated 3 months ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆13Updated 9 months ago
- ☆19Updated 3 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆37Updated 2 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated 3 months ago
- ☆17Updated last month
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆45Updated last month
- ☆91Updated last year
- ☆26Updated last year
- ICCV 2025: Frequency-Dynamic Attention Modulation for Dense Prediction☆17Updated last month
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 5 months ago
- [RA-L 2025] CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes☆16Updated 5 months ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆28Updated 2 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆29Updated 5 months ago
- [ECCV'24] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation☆38Updated 7 months ago