ClaudiaCuttano / SAMWISELinks
[CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆326Updated last week
Alternatives and similar repositories for SAMWISE
Users that are interested in SAMWISE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆410Updated last week
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆117Updated 2 months ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆92Updated 2 weeks ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆161Updated 2 weeks ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆412Updated 3 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆72Updated 3 weeks ago
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆108Updated this week
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆360Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆77Updated 3 weeks ago
- ☆28Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆474Updated 6 months ago
- Muggled SAM: Segmentation without the magic☆161Updated 3 weeks ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆60Updated 7 months ago
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning☆245Updated last week
- Scaling Vision Pre-Training to 4K Resolution☆205Updated last month
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆484Updated 3 weeks ago
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆278Updated 3 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆260Updated 11 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆211Updated 6 months ago
- [Fully open] [Encoder-free MLLM] Vision as LoRA☆339Updated 3 months ago
- ☆128Updated last year
- The Missing Point in Vision Transformers for Universal Image Segmentation☆51Updated 4 months ago
- ☆78Updated 5 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆87Updated 3 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆126Updated 3 months ago
- ☆81Updated 6 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆109Updated 2 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆261Updated 5 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- Efficient Track Anything☆640Updated 9 months ago