jyLin8100 / GenSAMLinks
Code for AAAl 2024 paper: Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects
☆149Updated 3 months ago
Alternatives and similar repositories for GenSAM
Users that are interested in GenSAM are comparing it to the libraries listed below
Sorting:
- Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)☆184Updated 6 months ago
- [ICCV 2023] BoxSnake official repository.☆64Updated last year
- ☆192Updated 4 months ago
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆92Updated 11 months ago
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆122Updated 2 years ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆110Updated 2 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆156Updated 8 months ago
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆59Updated 6 months ago
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆568Updated last year
- ☆142Updated last year
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆170Updated 10 months ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆65Updated 3 months ago
- SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree☆476Updated last month
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆132Updated 2 months ago
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆187Updated last year
- ☆76Updated last year
- [AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training☆92Updated last year
- ☆93Updated 11 months ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".